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LEProOPTERAN GABA-GATED CHLORIDE CHANNELS 

The present invention relates to GABA-gated chloride channels from insects of 
5 the order lepidoptera, which are butterflies, moths and skippers that as adults have four 
broad or lanceolate wings. 

Gamma amino butyric acid (GABA) is the major inhibitory neurotransmitter in 
the insect CNS and periphery; modulating membrane potential through a GABA-gated 
chloride channel (Anthony et al., GABA receptor molecules of insects, Y. Pichon, ed., 

10 Birkhauser Verlag, Basel, Switzerland, 1993; Bloomquist, Am. Rev, Entomol 41: 

163-190, 1996; Hosie et al., Brit, J. Pharmacol 115: 909-912, 1995). The significance of 
this GABA-gated channel (i.e., GABA receptor) as the site of action for a number of 
conmiercial insecticides has been known since the 1960s, but attempts to isolate the gene 
have been frustrated by the low homology between the insect sequence and available 

15 vertebrate probes and a low transcript abundance (Darlison, Trends in Neur Sci, 15: 

469-474, 1992; flfrench-Constant, Insect Biochem. Molec. Biol 24: 335-345, 1994). More 
recently, a series of studies, directed by R. flfrench-Constant, utilized a conventional 
genetic approach that successfully located the gene (rdl) that determines resistance to 
dieldrin on the Drosophila polytene chromosome map (ffrench-Constant, Experimentia 

20 Supplementum. 63: 210-223, 1993; ffrench-Constant et al.. Nature 363: 449-451, 1993). 
Isolation and expression of Drosophila rdl has established its function as a GABA-gated 
chloride channel, though it has less than 35% homology to any of the subunits which 
constitute the functional analogue in vertebrates. 

Isolation of the Drosophila sequence has since been followed by full-length 

25 determinations of rdl-like GABA receptors from the mosquito Aedes aegypti as well as 
partial sequences from the flour beetle and a roach (Kaku and Matsumura, Comparative 
Biochemistry and Physiology C Pharmacology Toxicology and Endocrinology 108: 
367-376, 1994; Miyazaki et al., Comparative Biochemistry and Physiology 111, 399-406, 
1995; Thompson et al.. Insect Mol Biol 2: 149-154, 1993; Thompson et al., FEBS Letters 

30 325: 1 87-190, 1993). These gene determinations have allowed analyses to be conducted 
across several orders of insects showing that many species have adopted the same apparent 
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strategy for developing resistance to insecticides that act at the chloride channel; mutation 
of a critical alanine in the second transmembrane domain to a serine. Indeed, site-directed 
mutagenesis experiments in heterologous expression systems have shown that altering this 
single residue is sufficient to reduce insecticidal potency by three orders of magnitude 
5 (Cole et al., Life Sciences 56: 757-765, 1995; Hosie et al., Brain Res. 693: 257-260, 1995; 
Lee et ah, FEBS Letters 335: 351-318, 1993; Shotkoski et al., FEBSLett. 80: 257-262, 
1996). One of the more intriguing questions raised by the studies of resistance is why the 
mutation occurs at a low, but significant frequency in naive populations or in populations 
which have not been subjected to insecticide selection pressure in decades 

1 0 (ffrench-Constant, 1 994). 

Described herein are two lepidopteran receptor isoforms. In particular, these 
isoforms were isolated from the tobacco budworm (TB W) Heliothis virescens. One 
isoform, TBW-a3 has in the second transmembrane the motif ProAlaArgVal AlaLeu (or 
PARVAL) usually associated with dieldrin susceptibility, while the other, TBW-a2, has the 

1 5 motif ProAlaArgVaP'^SgrLeu (PARVSL, numbered as in SEQ ID 4) usually associated 
with dieldrin resistance. Genomic analysis reveals that both isoforms occur simultaneously 
in the same insecticide susceptible animals. Also described herein is a receptor isoform, 
TBW-al, that has an unprecedented motif of ProAlaArgValGlnLeu (or PARVQL). 
Summarv of the Invention 

20 In a first embodiment, the invention provides an isolated nucleic acid encoding a 

GABA-gated chloride channel comprising: 

(a) a nucleic acid including a sequence encoding a protein sequence of 
SEQ ID 3, SEQ ID 6 or SEQ ID 8, or a sequence having at least about 85% sequence 
identity with SEQ ID 3, SEQ ID 6 or SEQ ID 8; or 

25 (b) a nucleic acid that hybridizes with a nucleic acid encoding a protein 

sequence of SEQ ID 3, SEQ ID 6 or SEQ ID 8 or the complementary sequence to SEQ 
ID 3, SEQ ID 6 or SEQ ID 8, under stringent conditions; or 

(c) a nucleic acid that hybridizes with a nucleic acid having a sequence of 
SEQ ID 1, SEQ ID 4 or SEQ ID 7 or the complementary sequence to SEQ ID 1, SEQ ID 

30 4 or SEQ ID 7, under stringent conditions; or 
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(d) a nucleic acid has at least about 85 % sequence identity with Ihe 
coding region of SEQ ID 1, SEQ ID 4 or SEQ ID 7. 

In a second embodiment, the invention provides cells with the nucleic acid of the 
invention, which preferably express the channel at the cell surface. In another 
5 embodiment, the invention provides a process for producing a GABA-gated chloride 
protein in a cell of the invention, preferably by: growing the cell in a medium; and 
inducing the expression of the GABA-gated chloride channel by adding an expression 
inducing agent into the medium. The invention further provides the GABA-gated 
chloride channel, for instance as isolated from a cell of the invention. 

10 In another embodiment, the invention provides a method for characterizing a 

bioactive agent, the method comprising (a) providing a first assay composition 
comprising (i) a cell expressing a GABA-gated chloride channel or (ii) an isolated 
GABA-gated chloride channel comprising the amino acid sequence encoded by the 
nucleic acid of the vector, or the amino acid sequence resulting from cellular processing 

1 5 of the amino acid sequence encoded by the nucleic acid of the vector, (b) contacting the 
first assay composition with the bioactive agent or a prospective bioactive agent, and (c) 
measuring the binding of the bioactive agent or prospective bioactive agent or a cellular 
response mediated by a isolated GABA-gated chloride channel. 

The invention further provides hybridization probes that selectively hybridize 

20 with a nucleic acid of the invention, or the complementary sequence thereof The 

hybridization probe can be an amplification primer and the amplification conditions can 
be made sufficiently specific to amplify a GABA-gated chloride channel sequence from 
lepidoptera but not to amplify a GABA-gated chloride chaimel sequence from other 
insects such as Drosophila, Aedes, locust or beetle. 

25 Brief Description of the Drawings 

Figure 1 shows the sequences (nucleic acid [SEQ ID l]and protein [SEQ ID 2]) 
ofTBW-a2. 

Figure 2 shows the sequences (nucleic acid [SEQ ID 4] and protein [SEQ ID 5]) 
ofTBW-a3. 
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Figure 3 shows the sequences (nucleic acid [SEQ ID 7] and protein [SEQ ID 8]) 
ofTBW-al. 
Definitions 

For the purposes of this application, the following terms shall have the respective 
5 meanings set forth below. 
•Amplimer 

An amplimer is a nucleic acid which is an amplified copy of a sequence of another 
nucleic acid. Amplimers are typically produced by an amplification process such as the 
polymerase chain reaction. 

1 0 •Bioactive agent 

A bioactive agent is a substance such as a chemical that can act on a cell, virus, tissue, 
organ or organism to create a change in the functioning of the cell, virus, organ or 
organism. Preferably, the organism is an insect. In a preferred embodiment of the 
invention, the method of identifying bioactive agents of the invention is applied to 

1 5 organic molecules having molecular weight of about 1 ,000 or less. 
•Extrinsically-derived nucleic acid 

Extrinsically-derived nucleic acids are nucleic acids found in a cell that were introduced 
into the cell, a parent or ancestor of the cell, or a transgenic animal from which the cell is 
derived through a recombinant technology. 

20 •Promoter functionally associated with a nucleic acid 

An extrinsic promoter for a protein-encoding nucleic acid is a promoter distinct from that 
used in nature to express a nucleic acid for that protein. A promoter is functionally 
associated with the nucleic acid if in a cell that is compatible with the promoter the 
promoter can act to allow the transcription of the nucleic acid. 

25 •Prospective agent 

Prospective agents are substances which are being tested by the screening method of the 
invention to determine if they affect the function of a GABA-gated chloride channel. 
• Sequence identity 

"Identity," as known in the art, is a relationship between two or more polypeptide 
30 sequences or two or more polynucleotide sequences, as determined by comparing the 
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sequences, particularly, as determined by the match between strings of such sequences. 
"Identity" is readily calculated by known methods {Computational Molecular Biology, 
Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics 
and Genome Projects, Smith, D.W., ed.. Academic Press, New York, 1993; Computer 
5 Analysis of Sequence Data, Part I, GrifFm, A.M., and Griffin, H.G., eds., Humana Press, 
New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic 
Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M 
Stockton Press, New York, 1991). While there exist a number of methods to measure 
identity between two sequences, the term is well known to skilled artisans (see, for 

1 0 example, Sequence Analysis in Molecular Biology, Sequence Analysis Primer, and 
Carillo, H., and Lipman, D., SIAM J, Applied MatK 48; 1073 (1988)). Methods 
commonly employed to determine identity between sequences include, but are not 
limited to those disclosed in Carillo, H., and Lipman, D., SIAM J. Applied Math, 
48; 1 073 (1 988) or Needleman and Wunsch, 1 Mol Biol , 48: 443-445, 1 970, or the 

15 Lipman-Pearson FASTA algorithm {Proc, Natl Acad ScL USA 85: 2444-2448, 1988). 
Computer programs for determining identity are publicly available. A preferred 
computer program for determining sequence identity is the program in Geneworks v 2.5 
(Intelligenetics Inc, Mountain View CA), which uses a progressive alignment procedure 
similar to FASTA. Preferably the parameters used with the Geneworks program are 

20 those which were the default parameters as of April 28, 1997, or, for proteins: cost to 
open gap = 50, lengthen gap = 100, minimum diagonal length = 4, maximum diagonal 
offset =125. Other computer program methods to determine identity between two 
sequences include, but are not limited to, GCG program package (Devereux, J., et al.. 
Nucleic Acids Research 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Atschul, 

25 S.F, et al., J. Molec. Biol 215: 403-410 (1990)). The BLAST X program is publicly 
available from NCBI (blast@ncbi.nlm.nih.gov)and other sources (BLAST Manual, 
Altschul, S., et al, NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al,J. Mol 
Biol 215: 403-410(1990)). 
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Detailed Description of the Invention 

The present invention relates to a number of nucleic acid and protein sequences. 
For the gene TBW-a2: SEQ ID 1 is a cDNA sequence encompassing the open reading 
fiame; SEQ ID 2 is the protein encoded by SEQ ID 1 ; and SEQ ID 3 is the same protein 

5 minus the signal peptide. For the gene TBW-a3: SEQ ID 4 is a cDNA; SEQ ID 5 is the 
protein sequence encoded by SEQ ID 4; and SEQ ID 6 is the same protein minus the 
signal peptide. For the gene TBW-al : SEQ ID 7 is a cDNA sequence; and SEQ IDS is 
the protein sequence encoded by SEQ ID 7. The TBW-a2, TBW-a3 and TBW-al protein 
sequences are related to other insect GABA-gated chloride channels as set forth in the 

1 0 table below, where the relatedness values were determined using Geneworks v 2.5 
program with the following parameters: cost to open gap = 50, lengthen gap = 100, 
minimum diagonal length = 4, maximum diagonal offset =125. 





Percentage Identity 


Sequence: 


Drosophila 


TBW-a2 
(SEQ ID 2) 


TBW-a3 
(SEQ ID 5) 


Drosophila 
rdl' 


Aedes rdP 


28 


72 


74 


75 


Drosophila 
b 




32 


31 


28 


TBW-a2 
(SEQ ID 2) 






84 


64 


TBW-a3 
(SEQ ID 5) 








66 



^Genebank Accession No. M69057, ffrench et al., Proc, Natl Acad. ScL USA 88: 7209- 



15 7213, 1991. 

^Genebank Accession No. U28803, Thompson et al, FEBS Letters 325: 1 87-1 90, 1 993. 
^Genebank Accession No, LI 7436, Henderson et al., Biochem, Biophys. Res, Commun, 
193:474-482, 1993. 

20 In Figure 1, the apparent signal peptide is denoted with the three-letter amino acid 

code, while the remaining amino acid sequence is denoted with the one-letter code. The 
signal peptide was identified by an examination of the charge and polarity characteristics 
of the N-terminal portion, which examination shows the three domains (the n-region, 
h-region and c-region) typically associated with a signal peptide (Heijne and Abrahmsen, 
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FEES Letters 244: 439-446, 1989). Although the Ala Gly Ala sequence (amino acids 
30-32) just preceding a run of four glycines is in compliance with the (-3, -l)-rule for 
identifying a signal peptide cleavage site, a weighted matrix analysis did not strongly 
identify a specific signal peptide cleavage site (Heijne, J. Membr, Biol 115: 195-201, 
5 1986). In Figure 2, the apparent signal peptide of TBW-a3 is shown. The present 

invention further relates to isolated proteins in which these signal proteins are removed or 
substituted with another signal sequence. The substitution of one signal sequence with 
another and expression of the resulting proteins is illustrated, for the echistatin protein 
expressed in Sf5 cells, by Daugherty et al., DNA Cell Biol 9: 453-9, 1990. These authors 

10 also describe the use of computer-aided signal peptide selection. 

Further, in Figure 1, the dashed underlining in the region encoding TBW-a2 
indicates a "beta cysteine loop" that is characteristic of the ligand gated channel 
superfamily. The underlined four peptide sequences are apparent transmembrane segments. 
Also underlined is an AATAAA consensus polyadenylation/cleavage signal. In Figure 2, 

1 5 the first underlining in the region encoding TB W-a3 indicates the beta cysteine loop. The 
next four underlined peptide sequences are apparent transmembrane segments. 

By analogy to other GABA-gated chloride channels, it is likely that the functional 
protein is multimeric. See, e.g., Sitghs^ri, Pharmacol Reviews 41: 191-234, 1995. Insect 
channels are believed to typically be homooligomers. 

20 Nucleic Acid - encoding GABA-2ated chloride channel 

To construct non-naturally occurring GABA-gated chloride channel-encoding 
nucleic acids, the native sequences can be used as a starting point and modified to suit 
particular needs. For instance, the sequences can be mutated to incorporate useful 
restriction sites. See Maniatis et al. Molecular Cloning, a Laboratory Manual (Cold 

25 Spring Harbor Press, 1989). Such restriction sites can be used to create "cassettes", or 
regions of nucleic acid sequence that are facilely substituted using restriction enzymes 
and ligation reactions. The cassettes can be used to substitute synthetic sequences 
encoding mutated GABA-gated chloride channel amino acid sequences. Alternatively, 
the GABA-gated chloride channel-encoding sequence can be substantially or fully 

30 synthetic. See, for example, Goeddel et al., Proc. Natl Acad. Set USA, 76: 106-1 10, 
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1979. For recombinant expression purposes, codon usage preferences for the organism 
in which such a nucleic acid is to be expressed are advantageously considered in 
designing a synthetic GABA-gated chloride channel-encoding nucleic acid. For 
example, a nucleic acid sequence incorporating prokaryotic codon preferences can be 
5 designed from a eukaryotic-derived sequence using a software program such as Oligo-4, 
available from National Biosciences, Inc. (Plymouth, N4N). 

The nucleic acid sequence embodiments of the invention are preferably 
deoxyribonucleic acid sequences, preferably double-stranded deoxyribonucleic acid 
sequences. However, they can also be ribonucleic acid sequences, or nucleic acid 

1 0 mimics, meaning compounds designed to preserve the hydrogen bonding and base- 
pairing properties of nucleic acid, but which differ from natural nucleic acid in, for 
example, susceptibility to nucleases. 

Numerous methods are known to delete sequence from or mutate nucleic acid 
sequences that encode a protein and to confirm the function of the proteins encoded by 

1 5 these deleted or mutated sequences. Accordingly, the invention also relates to a mutated 
or deleted version of a nucleic acid sequence that encodes a protein that retains the ability 
to transport chloride across a membrane, especially if such transport is turned on or 
enhanced by the presence of GAB A. These analogs can have N-terminal, C-terminal or 
internal deletions or substitutions, so long as GABA-gated chloride channel function is 

20 retained. The point variations are preferably conservative point variations. Preferably, 
the analogs will have at least about 85% sequence identity, preferably at least about 90%, 
more preferably at least about 95% still more preferably at least about 98% yet still more 
preferably at least about 99.5%, to the corresponding protein sequence of SEQ ID 3, SEQ 
ID 6 or SEQ ID 8. 

25 Mutational and deletional approaches can be applied to all of the nucleic acid 

sequences of the invention that express GABA-gated chloride channel proteins. As 
discussed above, conservative mutations are preferred. Such conservative mutations 
include mutations that switch one amino acid for another within one of the following 
groups: 
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1 . Small aliphatic, nonpolar or slightly polar residues: Ala, Ser, Thr, Pro and 

Gly; 

2. Polar, negatively charged residues and their amides: Asp, Asn, Glu and 

Gin; 

5 3, Polar, positively charged residues: His, Arg and Lys; 

4. Large aliphatic, nonpolar residues: Met, Leu, He, Val and Cys; and 

5. Aromatic residues: Phe, Tyr and Trp. 

A preferred listing of conservative variations is the following 



viriuinai ivesiuuc 


V <iriaiiuii 


Ala 


Gly, Ser 


Arg 


Lys 


Asn 


Gin, His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Ala, Pro 


His 


Asn, Gin 


He 


Leu, Val 


Leu 


He, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, Tyr, He 


Phe 


Met, Leu, Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Try 


Tyr 


Trp, Phe 


Val 


He, Leu 



10 The types of variations selected may be based on the analysis of the frequencies of amino 
acid variations between homologous proteins of different species developed by Schulz et 
al.. Principles of Protein Structure, Springer-Verlag, 1978, on the analyses of structure- 
forming potentials developed by Chou and Fasman, Biochemistry 13: 21 1, 1974 and Adv. 
EnzymoL, 47: 45-149, 1978, and on the analysis of hydrophobicity patterns in proteins 

15 developed by Eisenberg et al, Proc, Natl, Acad Sci, USA 81: 140-144, 1984; Kyte & 
Doolittle; J. Molec, Biol 157: 105-132, 1981, and Goldman et sA.^Ann, Rev. Biophys. 
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Chem, 15: 321-353, 1986. All of the references of this paragraph are incorporated herein 
in their entirety by reference. 

For the purposes of this application, a nucleic acid of the invention is "isolated" if 
it has been separated from other macromolecules of the cell or tissue from which it is 
5 derived. 

Nucleic Acid Probes 

It will be recognized that many deletional or mutational analogs of nucleic acid 
sequences for a GABA-gated chloride chaimel will be effective hybridization probes for 
GABA-gated chloride channel-encoding nucleic acid. Accordingly, the invention relates 

1 0 to nucleic acid sequences that hybridize with such GABA-gated chloride channel- 
encoding nucleic acid sequences under selection conditions. Preferably, the nucleic acid 
sequence selects for the nucleic acid sequence encoding SEQ ID 3, SEQ ID 6 or SEQ ID 
8. Probing can comprise, for example, hybridization, Rnase protection or amplification. 
"Selective conditions" refers to conditions that allow for the identification of 

15 substantially related nucleic acid sequences; and, in this context, refers to conditions that 
distinguish GABA-gated chloride channel from the reported Drosophila, beetle and roach 
rdl or rdl-related genes. For instance, for hybridization such conditions are stringent 
conditions that will generally allow hybridization of sequence with at least about 85% 
sequence identity, preferably with at least about 90% sequence identity, more preferably 

20 with at least about 95% sequence identity, for example with a nucleic acid encoding SEQ 
ID 3, SEQ ID 6 or SEQ ID 8. Such hybridization conditions are described by Sambrook 
et al.. Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbor Press, 
1989. For exaniple, such conditions include ovemight incubation at 42''C in a solution 
comprising: 50% formamide, 5xSSC (150mM NaCl, 15mM trisodium citrate), 50 mM 

25 sodium phosphate (pH7.6), 5x Denhardt's solution, 1 0% dextran sulfate, and 20 

micrograms/ml denatured, sheared salmon sperm DNA, followed by washing the filters 
in O.lx SSC at about 65''C. Hybridization and wash conditions are well known and 
exemplified in Sambrook, et a/.. Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Harbor, N.Y., (1989), particularly Chapter 1 1 therein, the disclosure 

30 of which is hereby incorporated in its entirety by reference. Hybridization conditions and 
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probes can be adjusted in well-characterized ways to achieve selective hybridization of 
probes. 

Nucleic acid molecules that will hybridize to a GAB A-gated chloride channel- 
encoding nucleic acid under stringent conditions can be identified functionally, using 
5 methods outlined above, or by using for example the hybridization rules reviewed in 
Sambrook et al.. Molecular Cloning: A Laboratory Manual, 2nd ed.. Cold Spring Harbor 
Press, 1989. 

Without limitation, examples of the uses for nucleic acid probes include: 
histochemical uses such as identifying tissues that express the GABA-gated chloride 

10 channel; measuring mRNA levels, for instance to identify a sample's tissue type or to 
identify cells that express abnormal levels of GABA-gated chloride channel; and 
detecting polymorphisms in the GABA-gated chloride channel gene. RNA hybridization 
procedures are described in Maniatis et al. Molecular Cloning, a Laboratory Manual 
(Cold Spring Harbor Press, 1989). 

15 Amplification Primers 

Rules for designing polymerase chain reaction ("PGR") primers are now 
established, as reviewed by PCR Protocols, Cold Spring Harbor Press, 1991. Degenerate 
primers, i.e., preparations of primers that are heterogeneous at given sequence locations, 
can be designed to amplify nucleic acid sequences that are highly homologous to, but not 

20 identical to, a GABA receptor-encoding nucleic acid. Strategies are now available that 
allow for only one of the primers to be required to specifically hybridize with a known 
sequence. See, Froman et al, Proc. Natl Acad ScL USA 85: 8998, 1988 and Loh et al. 
Science 243: 217, 1989. For example, appropriate nucleic acid primers can be ligated to 
the nucleic acid sought to be amplified to provide the hybridization partner for one of the 

25 primers. In this way, only one of the primers need be based on the sequence of the 
nucleic acid sought to be amplified. 

PCR methods of amplifying nucleic acid will utilize at least two primers. One of 
these primers will be capable of hybridizing to a first strand of the nucleic acid to be 
amplified and of priming nucleic acid synthesis in a first direction. The other will be 

30 capable of hybridizing the reciprocal sequence of the first strand (if the sequence to be 
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amplified is single stranded, this sequence will initially be hypothetical, but will be 
synthesized in the first amplification cycle) and of priming nucleic acid synthesis fi-om 
that strand in the direction opposite the first direction and towards the site of 
hybridization for the first primer. Conditions for conducting such amplifications, 
5 particularly under preferred stringent hybridization conditions, are well known. See, for 
example, PCR Protocols, Cold Spring Harbor Press, 1991. 

Other amplification procedures are available that utilize oligonucleotides to direct 
the specificity of the amplification, such as the ligase chain reaction (LCR). LCR uses 
the source nucleic acid as a template to bring two probe oligonucleotides close to one 

1 0 another to allow ligation (with or without provision for polymerization to fill in relatively 
small gaps between the probes). Upon ligation, the two linked probes provide additional 
template for the next cycle of the reaction. As with PCR, approaches can be devised to 
use a single probe corresponding to the source nucleic acid. The present invention also 
encompasses oligonucleotides designed to specifically identify GABA-gated chloride 

15 channels. 
Vectors 

A suitable expression vector is capable of fostering expression of the included 
GABA-gated chloride channel encoding DNA in a host cell, which can be eukaryotic, 
fungal, or prokaryotic. Suitable expression vectors include pRc/CMV (Invitrogen, San 

20 Diego, CA), pRc/RSV (Invitrogen), pcDNA3 (Invitrogen), Zap Express Vector 

(Stratagene Cloning Systems, LaJolla, CA); pBk/CMV or pBk-RSV vectors (Stratagene), 
Bluescript II SK +/- Phagemid Vectors (Stratagene), LacSwitch (Stratagene), pMAM and 
pMAM neo (Clontech, Palo Alto, CA), pKSVlO (Pharmacia, Piscataway, NJ), pCRscript 
(Stratagene) and pCR2.1 (Invitrogen), among others. Useful yeast expression systems 

25 include, for example, pYEUra3 (Clontech). Useful baculovirus vectors include several 
viral vectors from Invitrogen (San Diego, CA) such as pVL1393, pVL1392, pBluBac2, 
pBluBacHis A, B or C, and pbacPAC6 (from Clontech). A preferred vector is any of the 
pIEl -series vectors (Novagen, Madison WI) utilizing the EIP (early inducible promoter) 
baculovirus promoters for expression in Sf9 or Sf21 cells. Of course, expression can 
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simply comprise expression of in vitro produced RNA in a cell or a cell-free system. In 

some embodiments, inducible promoters are preferred. 

Cells 

In one embodiment of the invention, the channel is expressed in a eukaryotic cell 

5 line, preferably a transformed cell line with an established cell culture history. In this 
embodiment, particularly preferred cell lines include lepidopteran cells such as Sf9 and 
S£21 cells (available for example from Clontech, Palo Alto, CA) and Drosophila cells 
such as Schneider-2 or Kc cells. Other useful cells include mammalian cells such as 
COS or CHO cells, fungal cells such as yeast cells, and bacterial cells. Considerations 

10 for expressing membrane-bound receptors in bacteria can be found in Freissmuth et al., 
"Expression of two human beta-adrenergic receptors in Escherichia coli," Proc, Natl 
Acad. Sci. USA 88: 8548-8552, 1991 and Herzog et al., "Human neuropeptide Yl 
receptor expressed in Escherichia coli retains its pharmacological properties,*' DNA Cell 
Biol 13: 1221-1225, 1994. 

15 Isolated GABA-2ated chloride channel 

The invention also provides for the GABA-gated chloride channel proteins 
encoded by any of the nucleic acids of the invention preferably in a purity achieved, for 
example, by applying protein purification methods, such as those described below, to a 
lysate of a recombinant cell according to the invention. 

20 The GABA-gated chloride channel variants of the above paragraphs can be used 

to create organisms or cells that produce GABA-gated chloride channel activity. 
Purification methods, including associated molecular biology methods, are described 
below. 

Method of Producing GABA-sated chloride channel 

25 One simplified method of isolating polypeptides synthesized by an organism 

under the direction of one of the nucleic acids of the invention is to recombinantly 
express a fusion protein wherein the fusion partner is facilely affinity purified. For 
instance, the fusion partner can be glutathione S-transferase, which is encoded on 
commercial expression vectors (e.g., vector pGEX4T3, available from Pharmacia, 

30 Piscataway, NJ). The fusion protein can then be purified on a glutathione affinity 
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column (for instance, that available from Pharmacia, Piscataway, New Jersey). Of 
course, the recombinant polypeptides can be affinity purified without such a fusion 
partner using an appropriate antibody that binds to GABA-gated chloride channel. 
Methods of producing such antibodies are available to those of ordinary skill in light of 
5 the ample description herein of GABA-gated chloride channel expression systems and 
known antibody production methods. See, for example, Ausubel et al., Short Protocols 
in Molecular Biology, John Wiley & Sons, New York, 1992. If fusion proteins are used, 
the fusion partner can be removed by partial proteolytic digestion approaches that 
preferentially attack unstructured regions such as the linkers between the fusion partner 

10 and GABA-gated chloride channel. The linkers can be designed to lack structure, for 

instance using the rules for secondary structure forming potential developed, for instance, 
by Chou and Fasman, Biochemistry 13: 21 1, 1974 and Chou and Fasman, Adv. in 
Enzymol 47: 45-147, 1978. The linker can also be designed to incorporate protease 
target amino acids, such as, arginine and lysine residues, the amino acids that define the 

15 sites cleaved by trypsin. To create the linkers, standard synthetic approaches for making 
oligonucleotides can be employed together with standard subcloning methodologies. 
Other fusion partners besides GST can be used. Procedures that utilize eukaryotic cells, 
particularly manunalian cells, are preferred since these cells will post-translationally 
modify the protein to create molecules highly similar to or functionally identical to 

20 native proteins. 

Additional purification techniques can be applied, including without limitation, 
preparative electrophoresis, FPLC (Pharmacia, Uppsala, Sweden), HPLC (e.g., using gel 
filtration, reverse-phase or mildly hydrophobic columns), gel filtration, differential 
precipitation (for instance, "salting out" precipitations), ion-exchange chromatography 

25 and affinity chromatography. 

Because GABA-gated chloride channel is a membrane protein, which by analogy 
to related channel proteins is believed to have four transmembrane sequences, isolation 
methods will often utilize detergent extractions, generally using detergents such as non- 
ionic detergents selected to maintain the appropriate secondary and tertiary structure of 

30 the protein. See, for example, Hjelmeland, "Solubilization of native membrane 
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proteins " in Methods in Enzymol, Vol. 182, M.P. Deutscher, ed., Academic Press, San 
Diego, CA, pp. 253-264, 1990 and Thomas and McNamee, "Purification of membrane 
proteins," in Methods in Enzymol, Vol. 182, pp. 499-520, 1990. For a description of 
methods for re-integrating a solubilized channel into a membrane, see Ohta et al., 
5 "Dynamic structures of adrenocortical cytochrome P-450 in proteoliposomes and 

microsomes: protein rotation study," Biochemistry 31 : 12680-7, 1992 and Krishnaswamy 
et al., "Role of the membrane surface in the activation of human coagulation factor X," J. 
Biol Chem. 261: 261 10-20, 1992. Integral proteins typically have at least one domain 
that extends away from the cell surface or other membrane. 

1 0 The isolation of GABA-gated chloride channel can comprise isolating 

membranes from cells that have been transformed to express GABA-gated chloride 
channel. Preferably, such cells express GABA-gated chloride channel in sufficient copy 
number such that the amount of GABA-gated chloride channel in a membrane fraction is 
at least about 1 0-fold higher than that found in comparable membranes from cells that 

1 5 naturally express GABA-gated chloride channel, more preferably the amount is at least 
about 100-fold, or still more preferably at least about 1000- fold, higher. If needed, 
specific membrane fractions, such as a plasma membrane fraction, can be isolated. 

For the purposes of this application, GABA-gated chloride channel is "isolated" if 
it has been separated from other proteins or other macromolecules of the cell or tissue 

20 from which it is derived. Preferably, the composition containing GABA-gated chloride 
channel is at least about 10-fold enriched, preferably at least about 100-fold, with respect 
to protein content, over the composition of the source cells. 
Method of Characterizins or Identiivins asent 

A method for the analysis of or screening for a bioactive agent, for instance for 

25 use as an insecticide, comprises determining some measure of activity of a bioactive 

agent or a prospective bioactive agent mediated by a recombinant GABA-gated chloride 
charmel. This determining can include culturing multiple (two or more) cell cultures, 
wherein the cultures are preferably of the same species, more preferably of the same 
strain or cellular subtype thereof, and the cells of each culture includes an nucleic acid 

30 encoding a recombinant GABA-gated chloride channel as described herein. At least 
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proteins " in Methods in Enzymol, Vol. 182, M.P. Deutscher, ed., Academic Press, San 
Diego, CA, pp. 253-264, 1990 and Thomas and McNamee, "Purification of membrane 
proteins," in Methods in EnzymoL , Vol. 1 82, pp. 499-520, 1 990. For a description of 
methods for re-integrating a solubiiized channel into a membrane, see Ohta et al., 
5 "Dynamic structures of adrenocortical cytochrome P-450 in proteoliposomes and 

microsomes: protein rotation study," Biochemistry 31: 12680-7, 1992 and Krishnaswamy 
et al, "Role of the membrane surface in the activation of human coagulation factor X," 1 
Biol. Chem. 267: 2611 0-20, 1992. Integral proteins typically have at least one domain 
that extends away from the cell surface or other membrane. 

1 0 The isolation of GABA-gated chloride channel can comprise isolating 

membranes from cells that have been transformed to express GABA-gated chloride 
channel. Preferably, such cells express GABA-gated chloride channel in sufficient copy 
number such that the amount of GABA-gated chloride channel in a membrane fraction is 
at least about 1 0-fold higher than that found in comparable membranes from cells that 

1 5 naturally express GABA-gated chloride channel, more preferably the amount is at least 
about 100-fold, or still more preferably at least about 1000-fold, higher. If needed, 
specific membrane fractions, such as a plasma membrane fraction, can be isolated. 

For the purposes of this application, GABA-gated chloride channel is "isolated" if 
it has been separated from other proteins or other macromolecules of the cell or tissue 

20 from which it is derived. Preferably, the composition containing GABA-gated chloride 
channel is at least about 10-fold enriched, preferably at least about 100-fold, with respect 
to protein content, over the composition of the source cells. 
Method of Characterizin2 or Identifying asent 

A method for the analysis of or screening for a bioactive agent, for instance for 

25 use as an insecticide, comprises determining some measure of activity of a bioactive 

agent or a prospective bioactive agent mediated by a recombinant GABA-gated chloride 
channel. This determining can include culturing multiple (two or more) cell cultureis, 
wherein the cultures are preferably of the same species, more preferably of the same 
strain or cellular subtype thereof, and the cells of each culture includes an nucleic acid 

30 encoding a recombinant GABA-gated chloride channel as described herein. At least 
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some of the cultures are contacted with bioactive agent or prospective bioactive agent, 
while controls are treated in parallel except that they are not contacted with bioactive 
agent or prospective bioactive agent. Binding activities can be identified, or other 
cellular responses, such as chloride conductance, can be monitored to provide an 
5 indication of whether a bioactive agent or prospective bioactive agent acts on the cells 
and is an agonist or antagonist. Preferably, cells are contacted with a bioactive agent or 
prospective agent that is an organic compound. Binding of a bioactive agent or 
prospective bioactive agent can be determined directly, in which case the prospective 
agent usually incorporates a radioisotope, such as or ^ ^C, or through competition 

1 0 with a labeled, known ligand. The results are then compared to results with cells that 
were not contacted with the bioactive agent or prospective bioactive agent (i.e., the 
control cell). Alternatively, particularly for binding assays, an assay can utilize a 
composition comprising an isolated GABA-gated chloride channel in place of cells. 
A ligand used in a binding assay is preferably radiolabeled with any detectable 

1 5 isotope, such as radioactive isotopes of iodide, carbon or hydrogen. Specific binding of 
the radiolabeled ligand is then determined by subtracting the radioactivity due to non- 
specific binding fi^om that which is due to total (/.e., specific and non-specific) binding of 
the radiolabeled ligand. The radioactivity due to non-specific binding is determined by 
measuring the amount of radiolabel associated with a GABA-gated chloride channel that 

20 has been contacted with both radiolabeled ligand and a significant excess of non- 

radiolabeled ligand, such as a 1,000-fold excess. The radioactivity due to total binding of 
the radiolabeled ligand is determined by measuring the amount of radiolabel boimd to the 
receptor preparation in the absence of non-radiolabeled ligand. 

A bioactive agent that affects a GABA-gated chloride channel of the invention 

25 can have a contrasting activity profile with respect to another GABA-gated chloride 

channel. In one embodiment, a preferred bioactive agent has specificity to bind GABA- 
gated chloride channel of the invention with at least about 50-fold greater affinity than its 
binding to one other GABA-gated chloride channel, more preferably at least about 500- 
fold greater affinity. The bioactive agent can be any compound, material, composition, 

30 mixture, or chemical, that can be presented to a receptor in a form that allows for the 
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agent to diffuse so as to contact the receptor. Other suitable bioactive agents in the 
context of the present invention include small organic compounds, preferably of 
molecular weight between about 100 daltons and about 1,000 daltons, and are composed 
of such functionalities as alkyl, aryl, alkene, alkyne, halo, cyano and other groups, 
5 including heteroatoms or not. The chemicals tested as prospective agents can be 

prepared using combinatorial chemical processes known in the art or conventional means 
for chemical synthesis. 

Additional indicators of cellular responses to a bioactive agent include, for 
example: flux of radioactive [^*C1] chloride ions; chlorine sensitive fluorescent probes 
10 (e.g. SPQ (6-methoxy-N-(3-sulfopropyl)quinolinium)); or changes in intercellular 

membrane potential measured by electrophysiological methods such as the patch clamp 
or with redox sensitive dyes such as acridine orange. 

The following examples further illustrate the present invention, but of course, 
should not be construed as in any way limiting its scope. 
15 Example 1 - GABA-gated Chloride Channel Sequence Identifications 
Materials and Methods. 

TBW polyA RNA isolation. DEPC was prepared to 0.1% in water, incubating at 
37C overnight, then autoclaving for 60 minutes. All glassware was baked for 4h at 
250*^0, bottle caps were soaked in 0. 1% DEPC. The microprobe of a Braun homogenizer 

20 was soaked in 50 mis 100% EtOH, then run in 25 mis RNAzoIB (a guanidinium 

hydrochloride preparation from CINNA-BIOTECX Labs, Inc., Houston, TX). Fourth 
instar TBW larvae were frozen in weigh boats placed on dry ice, heads were excised with 
razor blades and sets of 100 heads were collected in round bottom centrifuge tubes. The 
excised heads were homogenized at full speed for 30 s at room temperature. Extraction 

25 buffer (3 ml) from a Pharmacia Biotech QuickPrep Micro mRNA Purification kit 

(Pharmacia Biotech Inc., Piscataway, NJ) was added and homogenization was continued 
for 10 s. The macerate was clarified by centrifugation at 12000g in an SS34 rotor for 10 
minutes at RT. The supernatant was batch processed on oligo-dT spin columns from 
PMK as specified by the manufacturer. Three elutions totaling 1.5 ml were pooled and 

30 the RNA quantified by UV spectrometry. 
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Synthesis of first strand cDNA. Reverse transcription was initiated by addition 
of cloned Maloney Murine Leukemia Virus (M-MLV) reverse transcriptase to 0.5 
template RNA in the presence of all four dNTPs. The reactions were placed on a 
Geneamp 9600 (Perkin-Elmer-ABI, Foster City, CA) thermal cycler and held at 42*^0 for 
5 30 min; the M-MLV RT was then inactivated by heating to 99^*0 for 5 min followed by 5 
min at 5°C. 

PCR amplification. The 20 \i\ cDNA reaction was made to 100 ^il utilizing 
buffers and dNTPs supplied in a Perkin Elmer, Amplitaq based RT-PCR kit according to 
the manufacturers protocol. Amplifications utilizing degenerate primers typically 

1 0 employed annealing temperatures of 45 - 48*^0, those involving isoform specific primers 
used annealing temperatures in the range of 55 to 60°C. RACE reactions were carried 
out using primers and protocols supplied with the GIBCO BRL 5' and 3' RACE kits 
(GIBCO-BRL, Bethesda, MD). The PCR products were characterized by agarose gel 
electrophoresis. When secondary "nested" amplifications were carried out, bands were 

15 excised from NuSieve gels (FMC Corp., ) and remelted by heating to lO^'C. The molten 
agarose was diluted 1 : 1 with warm water and a 1 :5 |il aliquot was transferred directly to 
a second 100 ^1 amplification. 

Genomic DNA Isolations. Genomic DNA was isolated and purified from 10 to 
20 TBW larvae with reagents and protocols provided in a Pharmacia Biotech RapidPrep 

20 Macro Genomic DNA kit (Pharmacia, Piscataway, NJ), genomic isolations from 

individual larvae were made with the micro version of the same system. Amplifications 
using anchor adaptor ligated genomic DNA as template followed the strategy outlined by 
Roux et al. {BioTechniques 8: 48-57, 1990). 

Primer synthesis and design. Oligonucleotides were synthesized on an ABI 

25 model 392 DNA synthesizer (Perkin-Elmer-ABI, Foster City, CA) using reagents and 
procedures supplied by the manufacturer. The reaction products were isolated on 
ABI/PE OPC columns (Perkin-Elmer-ABI, Foster City, CA) and used without further 
purification. Biotinylated sequencing primers were made using the fifth bottle position 
on the synthesizer. PCR primers and probes were designed and annealing temperatures 

30 estimated using the OLIGO 4.0 program fi-om NBI Scientific Software (Plymouth, MN). 
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Subcloning and sequencing. Proteins were removed from PGR reactions by 
three extractions with Strataclean resin as specified by Stratagene Corp. (La JoUa, CA). 
If the primers included engineered restriction sites, they were then digested. More 
routinely, the amplimers were blunt ended by filling with Klenow polymerase treatment, 
5 then phosphorylated by routine procedures (Sambrook et al., 1989). The amplimers were 
then gel purified on Seaplaque or NuSieve gels (FMC Corp) and extracted from the 
agarose using a QIAEX kit (QIAGEN Corp., Chatsworth, CA). Alkaline lysis plasmid 
isolations and purifications were carried out with a Qiatip kit following the 
recommendations of QIAGEN Corp. Thermal cycle sequencing reactions utilized 5'-end 

1 0 labeled biotinylated sequencing primers and a Promega ftnol sequencing kit (Promega, 
Madison, WI). Reactions products were separated on 6% Long Ranger (FMC Corp) 7M 
Urea manual gels, then the biotinylated ladders were transferred to Immobilon (Millipore 
Corp., Bedford, MA) membranes and developed with a Phototope chemiluminesent kit 
following protocols developed by New England Biolabs (Beverly, MA). Alternatively, 

1 5 dye terminator cycling reactions were carried out with a Perkin Elmer Amplitaq FS 

sequencing kit and the reaction products were analyzed on 5% Long Ranger gels run in 
an ABI Prizm 377 automated DNA sequencer (Perkin-Elmer-ABI, Foster City, CA). 
Five to 10 clones carrying a PCR reaction product were sequenced in both directions 
until a consensus could be achieved between multiple clones as a means of avoiding 

20 errors in nucleotide assignments due to thermal polymerase mis-incorporations. 
Sequencing contigs were assembled using the Intelligenetics Gene Works program 
(Intelligenetics, Mountain View, CA). 



Primers. The primers utilized were as follows: 



Primer 


Sequence 


Trans- 
lation 


Orienta- 
tion 


1(#9)* 


GCRAANACCATNACRAARCA 




reverse 


5(#10) 


GTNGTCATNGTSAGNACNGT 




reverse 


6(#11) 


TGGGTNCCNGAYACNIT 


WVPDTF 


forward 


7 (#12) 


CCGAGCTCSWRTAYTTRTCDATRTC 




reverse 


8 (#13) 


CCGAGCTCARRTADATDATCCARTACAT 




reverse 


9(#14) 


AGGCGGCCGCGGNGTNACNATGTAYGT 


GVTMYV 


forward 
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A riuicr 




Trans- 
lation 


Orienta- 
tion 


10(#15) 


CTGCGGCCGCCARTTYTGGACNGAYCC 


QFWIDP 


forward 


11 (#16) 


AATCTAGAGGGTGTCTTTCTGGTTG 


VSFWL 


forward 


12 (#17) 


AGCTCGAGAGTTTCGGCTACACCAT 


SFGYTM 


forward 


13 (#18) 


TTCTCGAGCGATGGATTTGCACTATTTTC 


MDLQYF 


forward 


14 (#19) 


CAGAGCTCATTTCACATGCCAGACGAGAG 




reverse 


15(#20) 


TAGAGCTCGAATGATGAATGCGTATGAAT 


FIRIHH 


forward 


16(#21) 


TCTCTAGATACGCTCGATGGGATAC 


RSMGY 


forward 


24 (#22) 


TTGCGGCCGCCATATATCCCACAG 




reverse 


25(#23) 


CTGCGGCCGCTCGAGCTGGTG 




reverse 


26 (#24 ) 


CGGATGAATTCATTGCTGGTlbi i 




reverse 




P m rp p r; 7\ rp p p 7\ T P n A A G T AT T G 




reverse 


31 (#26) 


GCGGACCTCCATAGTTTGGTC 




reverse 


34 (#27) 


CAGACGAAGAAGCTGGACCACCTC 


DEEAGP- 
PP 


forward 


35{#28) 


ACGCGGCCGCAAGGACATAAGCAA 


KDISK 


forward 



*The SEQ ID numbers are in parentheses. 

The degenerate primers among the above oligonucleotides incorporate a statistical mix 
monomers at the positions labeled N (A, G, C or T), H (A, C or T), S (C or G), Y (C or 



T), W (A or T), D (A, G or T) or R (A or G) [in accordance with lUPAC convention]. 
5 The underlined sequences are restriction sites. 

TBW-al Sequence Amplifications 

All amplification descriptions for TBW-al designate sequence positions with 
respect to the corresponding sequences of TBW-a2 set forth in Figure 1 . 

In a nested PGR reaction, first Primers 6 and 1 , and then Primers 6 and 5 were 

1 0 used to amplify a fiagment from nucleotides 493 to 970 (excluding 37 bases from the 
primers) of TBW-al, which was cloned and sequenced. It will be recognized, for this 
amplification and in the other amplifications from mRNA described herein, that the 
amplification substrate was produced by reverse transcription with a reverse primer, in 
this case with Primer 1 . This sequence included the unique PARVQL motif discussed 

1 5 above. This sequence is not a result of polymerase misincorporation of sequencing error 
since it was found in clones arising from separate mRNA preparations, clones were 
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sequenced on both strands, and restriction analyses of a number of clones confirmed the 
presence of a PvuII site that is dependent on the Gin eodon. 

TBW-al sequence was extended downstream by reverse transcribing mRNA with 
the poly T Adapter Primer ("AP") provided in the GIBCO-BRL 3'-RACE system. The 
5 cDN A was then amplified by PGR reactions between Primer 1 2 and AP followed by a 
nested reaction between Primer 1 1 and AP generating an amplimer from 929 - 1 157 . 
The amplimer was isolated, cloned, and sequenced. 

TBW-a2 Sequence Amplifications 

TBW mRNA was reverse transcribed with Primer 1, then PGR reactions were 
10 conducted first between Primers 6 and 1 followed by a nested reaction between Primers 6 
and 5 yielding a fragment from 493 to 970 of TBW-a2. The amplified fragment was 
cloned and sequenced. This fragment included the PARVSL motif discussed above. 

TBW-a2 sequence was extended downstream by reverse transcribing mRNA with 
AP. The cDNA was then amplified by PGR reactions between Primer 13 and AP 
1 5 followed by a nested reaction between Primer 1 6 and the Universal Anchor Primer 
("UAP") provided in the GIBGO-BRL 3'-RAGE system; generating an amplimer from 
862 - 1 154. The amplimer was isolated, cloned, and sequenced. 

TBW-a2 sequence was extended fiirther downstream by reverse transcribing 
mRNA with Primer 8. The cDNA was then amplified by PGR reactions between Primer 
20 1 1 and 8 followed by a nested reaction between Primer 1 1 and 7; generating an amplimer 
from 929 to 1462 of TBW-a2, which was cloned and sequenced. 

The translational stop signal of the TBW-a2 sequence was revealed by a 
reapplication of the 3* RAGE strategy. TBW mRNA was reverse transcribed with the AP 
provided in the GIBGO-BRL 3'-RAGE system. The cDNA was then amplified by PGR 
25 reactions between Primer 16 and AP followed by a nested reaction between Primer 34 
and AP generating an amplimer from 1407 - 1824 including 230 bp of 3' untranslated 
region (UTR). 

The translational start signal of the TBW-a2 sequence was revealed by a 5' RAGE 
strategy. TBW mRNA was reverse transcribed with Primer 14, the resulting single 
30 stranded cDNA had a homopolymeric dGn tail added to the 3' end using Terminal 
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deoxynucleotidyl Transferase (TdT) as outlined in the GIBCO BRL 5' RACE kit version 
2.0. The cDNA was then amplified by PGR reactions between Primer 15 and the 
Abridged Anchor Primer (AAP) followed by a nested reaction between Primer 1 5 and 
UAP; generating an amplimer from 1 - 543 including 103 bp of 5* untranslated region. 
5 TBW'a3 Sequence Amplifications 

Initial TBW-a3 sequence was isolated by reverse transcribing mRNA with Primer 
8. The cDNA was then amplified by PGR reactions between Primer 1 1 and 8 followed 
by a nested reaction between Primer 1 1 and 7; generating an amplimer from 805- 1314. 
The TBW-a3 sequence was extended further upstream by reverse transcribing 

1 0 mRNA with Primer 24. The cDNA was then amplified by PGR reactions between 

Primer 9 and 24 followed by a nested reaction between Primer 10 and 25; generating an 
amplimer from 282 - 8 1 7. 

The TBW-a3 sequence was extended upstream into the signal peptide by a 5* 
RAGE strategy. TBW mRNA was reverse transcribed with Primer 29, the resulting 

1 5 single stranded cDNA had a homopolymeric dG„ tail added to the 3' end using Terminal 
deoxynucleotidyl Transferase (TdT) as outlined in the GIBGO BRL 5* RAGE kit version 
2.0. The cDNA was then amplified by PGR reactions between Primer 27 and AP 
followed by a nested reaction between Primer 26 and UAP; generating an amplimer fi-om 
53 - 404. The process was then repeated, this time following the reverse transcription 

20 reaction with PGR reactions between Primer 26 and AAP followed by a nested reaction 
between Primer 31 and UAP; generating an amplimer firom 1-154. 

The translational stop signal of the TBW-a3 sequence was revealed by a 
reapplication of the 3' RAGE strategy. TBW mRNA was reverse transcribed with the AP 
provided in the GIBCO-BRL 3*-RAGE system. The cDNA was then amplified by PGR 

25 reactions between Primer 35 and AP followed by a nested reaction between Primer 35 
and UAP generating an amplimer from 1293 - 1 5 19 including 72 bp 3* untranslated 
region (UTR). 

Amplifications of mRNA derived from individual TBW larvae, and confirmatory 
restriction length analyses, confirmed that both TBW-a2 and TBW-a3 are be found in the 
30 same individual. 
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Example 2 - Expression Vectors 



Primers. The primers utilized were as follows: 



rnmer 


i9ei|ueiice 


1 1 Alls'" 

lation 


Of ipntsi* 

tion 


13(#29) 


TTCTCGAGCGATGGATTTGCACTATTTTC 


MDLQYF 


forward 


17 (#30) 


AGTCCCGGCGGCAGGCTGATA 


5'UTR 


forward 


18 (#31) 


ATGACAATTAGGCCAGACGGAATA 




reverse 


19{#32) 


CATCCGATACAAGTGGAATG 


IRYKWNE 


forward 


20{#33) 


GTCGACCCAGTGCCAATATACACGAC 


3'UTR 


reverse 


21(#34) 


GTCGACTTACCGAAACTTGATGGATG 


3'UTR 


reverse 


24 (#35) 


TTGCGGCCGCCATATATCCCACAG 




reverse 


28(#36) 


GTGAAATACAATTCGTTCGGTCTA 


EIQFVRS 


forward 


30(#37) 


GCAGATGTGGAAAATAAGTGGATT 


3'UTR 


reverse 


32 ( #38 ) 


AGCGAA I ACLA 1 (jACbAAACA 




re ve r b e 


33(#39) 


GCGGCCGCCTTCCTAC 


AAAFLP 


forward 


36(#40) 


ACCATCCATTTAGACGACGC 


5' UTR 


forward 


37(#41) 


GTAACACCAACTTCCACCG 




reverse 


38(#42) 


GAATGGCCAACATGTCGCTGGAAATC 


MSLEI 


forward 


39(#43) 


AATAATGACGTCACCGAACATCCCTCCCC 
CACCG 




reverse 



TBW'a2 

5 Two overlapping pieces of TBW-a2 were amplified, and a unique Ncol site was 

used to ligate the two pieces into a complete TBW-a2 sequence. The 3' piece was 
created by using Primer 1 8 to prime reverse transcription (beginning at base 879), 
followed by PGR with Primers 17 and 18. The 5' piece was created by using Primer 20 
to prime reverse transcription (beginning at base 1764), followed by nested PGR with, 

1 0 first. Primers 1 3 and 20, then Primers 1 9 and 2 1 , The amplimers were cut with Ncol, 

polished, phosphorylated and ligated into a blunt-ended cloning site of a first vector. The 
insert from the first vector is used to provide an insert for one or more expression 
vectors. Or a vector with a T7 or Sp6 promoter is used, and RNA created from the vector 
is expressed by injection into a cell such as Xenopus oocyte. 
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Examole 2 - Expression Vectors 



Primers. The primers utilized were as follows: 



Primer 


9ei|ueiice 


lation 


tion 


13(#29) 


TTCTCGAGCGATGGATTTGCACTATTTTC 


MDLQYF 


forward 


17 (#30) 


AGTCCCGGCGGCAGGCTGATA 


5'UTR 


forward 


18 (#31) 


ATGACAATTAGGCCAGACGGAATA 




reverse 


19(#32) 


CATCCGATACAAGTGGAATG 


IRYKWNE 


forward 


20(#33) 


GTCGACCCAGTGCCAATATACACGAC 


3'UTR 


reverse 


21(#34) 


GTCGACTTACCGAAACTTGATGGATG 


3'UTR 


reverse 


24 (#35) 


TTGCGGCCGCCATATATCCCACAG 




reverse 


28 (#36) 


GTGAAATACAATTCGTTCGGTCTA 


EIQFVRS 


forward 


30(#37) 


GCAGATGTGGAAAATAAGTGGATT 


3'UTR 


reverse 


32 (#38 ) 


TV TV TV m TV 7\ m /~* 7v tv 7\ tv TV 

AGCGAATACCATGACGAAACA 




reverse 


33 (#39) 


GCGGCCGCCTTCCTAC 


AAAFLP 


forward 


36(#40) 


ACCATCCATTTAGACGACGC 


5' UTR 


forward 


37 (#41) 


GTAACACCAACTTCCACCG 




reverse 


38 (#42) 


GAATGGCCAACATGTCGCTGGAAATC 


MSLEI 


forward 


39(#43) 


AATAATGACGTCACCGAACATCCCTCCCC 
CACCG 




reverse 



5 Two overlapping pieces of TBW-a2 were amplified, and a unique Ncol site was 

used to ligate the two pieces into a complete TBW-a2 sequence. The 3' piece was 
created by using Primer 1 8 to prime reverse transcription (beginning at base 879), 
followed by PGR with Primers 17 and 18. The 5' piece was created by using Primer 20 
to prime reverse transcription (beginning at base 1764), followed by nested PGR with, 

1 0 first, Primers 1 3 and 20, then Primers 1 9 and 2 1 . The amplimers were cut with Ncol, 

polished, phosphorylated and ligated into a blunt-ended cloning site of a first vector. The 
insert from the first vector is used to provide an insert for one or more expression 
vectors. Or a vector with a T7 or Sp6 promoter is used, and RN A created from the vector 
is expressed by injection into a cell such as Xenopus oocyte. 



wo 98/49185 




PCT/US98/08563 



-24- 



TBW-aS 

For the a3 sequence the same strategy was followed in order to generate a vector 
that serves as a depository for the sequence. In order to add a translational start site, the 
first step was to construct a chimera by adding the Aedes aegypti signal peptide from its 
5 rdl gene to the 5' end of the TBW GAB A a3. The Aedes signal peptide was prepared in a 
two-step approach whereby it was first isolated using primers that exactly matched its 
sequence, and then was modified using highly engineered primers. Mosquito larvae were 
grown in-house and extracted for mRNA in the same manner as had been done for the 
TBW larvae heads. The signal peptide sequence was then reverse transcribed with 

10 Primer 37 and the cDNA amplified by PGR between Primer 36 and Primer 37 yielding a 
fragment from bp 556 - 909 (using the numbering of GenBank accession M69057). The 
amplimer was polished, phosphorylated and ligated into Smal cut pUC18 and fully 
sequenced and found to be in complete agreement with the GenBank deposit. This 
purified plasmid was then used as a template in a PGR reaction between Primer 38 and 

1 5 Primer 39 which amplified and modified the sequence fi-om 670 - 842 (bp 696 - 808 
lying between the primers) of the Aedes signal peptide; the bp 681 - 683 code for the 
ATG Met start signal. This reaction introduced an MscI site at the 5' end, and an Aatll 
site at the 3* end and changed bp 81 5 to 842 fi^om Aedes to TBW sequence. This 
engineered fi-agment was then ligated to an endogenous Aatll site in the TBW-a3 signal 

20 peptide (between bp 101 and 102) by digesting a TBW-a3 5* piece (a Primer 28 and 30 
amplimer, created by nested reaction after an initial amplification with Primers 33 and 
24) with Aatll. This chimeric fragment was then combined with the 3' piece (the Primer 
32 and 33 amplimer) by use of the unique Xhol site. 



SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: Hailing et al. 

(ii) TITLE OF THE INVENTION: Lepidopteran GABA-Gated Chloride 

Channels 

(iii) NUMBER OF SEQUENCES: 43 
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(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Dechert Price & Rhoad.s 

(B) STREET: PC Box 5218 

(C) CITY: Princeton 

(D) STATE: NJ 

(E) COUNTRY: USA 

(F) ZIP: 08543 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Allen Bloom 

(B) REGISTRATION NUMBER: 29,135 

(C) REFERENCE/DOCKET NUMBER: FMC 102 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 609-520-3214 

(B) TELEFAX: 609-520-3259 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 104... 1591 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CTTGACGCCT GAGGGNCTGT AAGAACACGC CAGTCCCGCC GGCAGGCTGA TACGCGGCTG 60 
CCGGCAGCCA GCGTCCGCAA GGGCGCACGC GGACCTGCAA AAC ATG CAT ACG AGC 115 

Met His Thr Ser 
1 

CGT CCG CGC GGC GTG CAC AGC ATC GCG CTA GTG CTG TCT CTC GCG ATT 163 
Arg Pro Arg Gly Val His Ser He Ala Leu Val Leu Ser Leu Ala He 
5 10 15 20 

GCC TGG TTA CCT CAT GCT GAC CAT GCC GCG GGA GCG GGA GGA GGG GGG 211 
Ala Trp Leu Pro His Ala Asp His Ala Ala Gly Ala Gly Gly Gly Gly 
25 30 35 
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ATG TTT GGT GAC GTC AAT ATC TCA GCC ATT TTG GAT TCG CTA AGT GTA 
Met Phe Gly Asp Val Asn He Ser Ala He Leu Asp Ser Leu Ser Val 
40 45 50 



259 



AGC TAG GAC AAA AGA GTG AGG CCG AAC TAT GGA 
Ser Tyr Asp Lys Arg Val Arg Pro Asn Tyr Gly 
55 60 

GTG GGA GTC ACC ATG TAC GTG CTC TCC ATC AGC 
Val Gly Val Thr Met Tyr Val Leu Ser He Ser 
70 75 

AAA ATG GAT TTC ACC CTG GAT TTC TAC TTC AGA 
Lys Met Asp Phe Thr Leu Asp Phe Tyr Phe Arg 
85 90 95 

CCC AGG CTT GCT TAC AAA AAA AGG ACG GGT GTG 
Pro Arg Leu Ala Tyr Lys Lys Arg Thr Gly Val 

105 110 



GGA CCG CCA 
Gly Pro Pro 
65 

TCC TTA TCT 
Ser Leu Ser 
80 

CAA TTT TGG 
Gin Phe Trp 



GAG ACT CTG 
Glu Thr Leu 



GTG GAT 
Val Asp 



GAA GTG 
Glu Val 



ACA GAC 
Thr Asp 
100 

TCC GTC 
Ser Val 
115 



307 



355 



403 



451 



GGC TCG GAA 
Gly Ser Glu 



AAC GAA AAA 
Asn Glu Lys 
135 



TTT ATT AGA AAC ATA TGG GTA CCC 
Phe He Arg Asn He Trp Val Pro 
120 125 

CAG TCT TAT TTC CAC ATA GCT ACT 
Gin Ser Tyr Phe His He Ala Thr 
140 



ATA CGC ATT 
He Arg He 
150 

ATC ACC GCT 
He Thr Ala 
165 

CAA TTA TGC 
Gin Leu Cys 



ATC CGA TAC 
He Arg Tyr 



GAA GTG TCT 
Glu Val Ser 
215 

ATG GAG ATT 
Met Glu He 
230 

ATT CAA TTT 
He Gin Phe 
245 

CCG TCT GGC 
Pro Ser Gly 



CAT CAT TCT GGA TCT ATT ACT AGG 
His His Ser Gly Ser He Thr Arg 
155 

TCT TGT CCG ATG GAT TTG CAG TAT 
Ser Cys Pro Met Asp Leu Gin Tyr 
170 175 

AAT ATT GAA ATC GAA AGT TTT GGC 
Asn He Glu He Glu Ser Phe Gly 
185 190 



AAG TGG 
Lys Trp 
200 

TTG CCG 
Leu Pro 



TCT CTT 
Ser Leu 



GTA CGC 
Val Arg 



CTA ATT 
Leu He 
265 



AAT GAG 
Asn Glu 



CAA TTC 
Gin Phe 



ACG ACA 
Thr Thr 
235 

TCG ATG 
Ser Met 
250 

GTC ATT 
Val He 



GGG CCC AAC TCA 
Gly Pro Asn Ser 
205 

AAG GTG CTG GGC 
Lys Val Leu Gly 
220 

GGA AAC TAC TCT 
Gly Asn Tyr Ser 



GGA TAC TAT TTA 
Gly Tyr Tyr Leu 
255 

ATA TCT TGG GTA 
He Ser Trp Val 
270 



GAC ACC TTC 
Asp Thr Phe 
130 

ACA AGC AAC 
Thr Ser Asn 
145 

AGT ATA AGA 
Ser He Arg 
160 

TTT CCG ATG 
Phe Pro Met 



TAC ACC ATG 
Tyr Thr Met 



GTG GGT GTG 
Val Gly Val 
210 

CAC CGG CAG 
His Arg Gin 
225 

CGT CTG GCA 
Arg Leu Ala 
240 

ATT CAG ATT 
He Gin He 



TTT GTT 
Phe Val 



GAA TTC 
Glu Phe 



TCA TTT TGG 
Ser Phe Trp 



CTG ACT 
Leu Thr 



GAC CGT 
Asp Arg 
180 

CGG GAC 
Arg Asp 
195 

TCG AGC 
Ser Ser 



CGG GCC 
Arg Ala 



TGT GAA 
Cys Glu 



TAT ATT 
Tyr He 
260 

TTG AAT 
Leu Asn 
275 



499 



547 



595 



643 



691 



739 



787 



835 



883 



931 



CGA AAT GCG ACA CCT GCA AGG GTA TCA CTA GGT GTC ACA ACT GTA TTG 



97 9 
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Arg Asn Ala Thr Pro Ala Arg Val Ser Leu Gly Val Thr Thr Val Leu 
280 285 290 

ACG ATG ACQ ACQ CTC ATG TCG TCC ACG AAT GCG GCT CTG CCC AAG ATC 1027 
Thr Met Thr Thr Leu Met Ser Ser Thr Asn Ala Ala Leu Pro Lys lie 
295 300 305 

TCA TAT GTC AAG TCC ATC GAT GTC TAT CTG GGA ACT TGT TTC GTC ATG 1075 
Ser Tyr Val Lys Ser lie Asp Val Tyr Leu Gly Thr Cys Phe Val Met 
310 315 320 

GTC TTC GCC AGT TTA CTA GAA TAT GCC ACG GTT GGC TAT ATG GCT AAA 1123 
Val Phe Ala Ser Leu Leu Glu Tyr Ala Thr Val Gly Tyr Met Ala Lys 
325 330 335 340 

AGG ATA CAG ATG AGG AAA CAA AGA TTC ACT GCT GTT CAA AAA ATG GCC 1171 
Arg He Gin Met Arg Lys Gin Arg Phe Thr Ala Val Gin Lys Met Ala 
345 350 355 

GCC GAG AAG A7VA ATG CAA ATA GAT GGT CCT CCA GGG TCA GCT GAG CCT 1219 
Ala Glu Lys Lys Met Gin He Asp Gly Pro Pro Gly Ser Ala Glu Pro 
360 365 370 

ATC CCC CCA CCG AGG ACC AGC ACC CTA TCT AGG CCA CCA CCA CCT AGC 1267 
He Pro Pro Pro Arg Thr Ser Thr Leu Ser Arg Pro Pro Pro Pro Ser 
375 380 385 

CGA TTA TCG GAG GTT CGG 'TTC AAA GTT CAC GAT CCG AAG GCA TAT TCT 1315 
Arg Leu Ser Glu Val Arg Phe Lys Val His Asp Pro Lys Ala Tyr Ser 
390 395 400 

AAA GGC GGT ACT TTA GAA AAC ACT ATC AAT GGG GCT CGG GGC CCA GCC 1363 
Lys Gly Gly Thr Leu Glu Asn Thr He Asn Gly Ala Arg Gly Pro Ala 
405 410 415 420 

CCA GGA CCT GCT CCA CCG GCA GAC GAA GAA GCT GGA CCA CCT CCG CAT 1411 
Pro Gly Pro Ala Pro Pro Ala Asp Glu Glu Ala Gly Pro Pro Pro His 
425 430 435 

CTC GTT CAT GCT TCC AAG GGT ATC AAC AAA CTG CTC GGC ACG ACC CCC 1459 
Leu Val His Ala Ser Lys Gly He Asn Lys Leu Leu Gly Thr Thr Pro 
440 445 450 

TCG GAC ATC GAC AAG TAC TCG CGC ATC GTG TTC CCC GTC TGC TTC GTT 1507 
Ser Asp He Asp Lys Tyr Ser Arg He Val Phe Pro Val Cys Phe Val 
455 460 465 

TGC TTT AAC CTT ATG TAC TGG ATC ATT TAC CTT CAC GTG TCT GAC GTC 1555 
Cys Phe Asn Leu Met Tyr Trp He He Tyr Leu His Val Ser Asp Val 
470 475 480 

GTG GCT GAT GAC TTG GTA CTA CTA GGC GAA GAA AAT TGAATTCTCT TTAACT 1607 
Val Ala Asp Asp Leu Val Leu Leu Gly Glu Glu Asn 

485 490 495 

ATACCGGACT TGTTTTAACT TAGGGTGCTT ATGATCAACC ATCCATCAAG TTTCGGTAAA 1667 

GTTCTTTAAA TCCTAGAAAC GCTCAGTAAA ATAATAGCGT TCTTTGTGTT TATAAATATA 1727 

ATTATAGTAC AGATCACTAT GTTTATTATA GATAAGTGTC GTGTATATTG GCACTGGTAA 1787 

TATTAATTCT TTAGAAAATA AAGATAATAT GAAGTTC7\AA AAAAAAAAAA AAAAAAA 184 4 



(2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met 


His 


Thr 


Ser 


Arg 


Pro 


Arg 


Gly 


Val 


His 


Ser 


He 


Ala 


Leu 


Val 


Leu 


1 








5 








10 










15 




Ser 


Leu 


Ala 


He 


Ala 


Trp 


Leu 


Pro 


His 


Ala 


Asp 


His 


Ala 


Ala 


Gly 


Ala 








20 










25 










30 






Gly 


Gly 


Gly 


Gly 


Met 


Phe 


Gly 


Asp 


Val 


Asn 


He 


Ser 


Ala 


He 


Leu 


Asp 






35 










40 










45 








Ser 


Leu 


Ser 


Val 


Ser 


Tyr 


Asp 


Lys 


Arg 


Val 


Arg 


Pro 


Asn 


Tyr 


Gly 


Gly 




50 










55 










60 










Pro 


Pro 


Val 


Asp 


Val 


Gly 


Val 


Thr 


Met 


Tyr 


Val 


Leu 


Ser 


He 


Ser 


Ser 


65 










70 










75 










80 


Leu 


Ser 


Glu 


Val 


Lys 


Met 


Asp 


Phe 


Thr 


Leu 


Asp 


Phe 


Tyr 


Phe 


Arg 


Gin 










85 










90 










95 




Phe 


Trp 


Thr 


Asp 


Pro 


Arg 


Leu 


Ala 


Tyr 


Lys 


Lys 


Arg 


Thr 


Gly 


Val 


Glu 








100 










105 










110 






Thr 


Leu 


Ser 


Val 


Gly 


Ser 


Glu 


Phe 


He 


Arg 


Asn 


He 


Trp 


Val 


Pro 


Asp 






115 










120 










125 








Thr 


Phe 


Phe 


Val 


Asn 


Glu 


Lys 


Gin 


Ser 


Tyr 


Phe 


His 


He 


Ala 


Thr 


Thr 




130 










135 










140 










Ser 


Asn 


Glu 


Phe 


He 


Arg 


He 


His 


His 


Ser 


Gly 


Ser 


He 


Thr 


Arg 


Ser 


145 










150 










155 










160 


He 


Arg 


Leu 


Thr 


He 


Thr 


Ala 


Ser 


Cys 


Pro 


Met 


Asp 


Leu 


Gin 


Tyr 


Phe 










165 










170 










175 




Pro 


Met 


Asp 


Arg 


Gin 


Leu 


Cys 


Asn 


He 


Glu 


He 


Glu 


Ser 


Phe 


Gly 


Tyr 








180 










185 










190 






Thr 


Met 


Arg 


Asp 


He 


Arg 


Tyr 


Lys 


Trp 


Asn 


Glu 


Gly 


Pro 


Asn 


Ser 


Val 






195 










200 










205 








Gly 


Val 


Ser 


Ser 


Glu 


Val 


Ser 


Leu 


Pro 


Gin 


Phe 


Lys 


Val 


Leu 


Gly 


His 


210 










215 










220 










Arg 


Gin 


Arg 


Ala 


Met 


Glu 


He 


Ser 


Leu 


Thr 


Thr 


Gly 


Asn 


Tyr 


Ser 


Arg 


225 










230 










235 










240 


Leu 


Ala 


Cys 


Glu 


He 


Gin 


Phe 


Val 


Arg 


Ser 


Met 


Gly 


Tyr 


Tyr 


Leu 


He 










245 










250 










255 




Gin 


lie 


Tyr 


lie 


Pro 


Ser 




Leu 


Tic* 

lie 


vai 


lie 


T 1 *a 

lie 


Ser 


irp 


vai 


Ser 






260 










265 










270 






Phe 


Trp 


Leu 


Asn 


Arg 


Asn 


Ala 


Thr 


Pro 


Ala 


Arg 


Val 


Ser 


Leu 


Gly 


Val 




275 










280 










285 








Thr 


Thr 


Val 


Leu 


Thr 


Met 


Thr 


Thr 


Leu 


Met 


Ser 


Ser 


Thr 


Asn 


Ala 


Ala 




290 










295 










300 










Leu 


Pro 


Lys 


He 


Ser 


Tyr 


Val 


Lys 


Ser 


He 


Asp 


Val 


Tyr 


Leu 


Gly 


Thr 


305 










310 










315 










320 


Cys 


Phe 


Val 


Met 


Val 


Phe 


Ala 


Ser 


Leu 


Leu 


Glu 


Tyr 


Ala 


Thr 


Val 


Gly 








325 










330 










335 




Tyr 


Met 


Ala 


Lys 


Arg 


He 


Gin 


Met 


Arg 


Lys 


Gin 


Arg 


Phe 


Thr 


Ala 


Val 








340 










345 










350 






Gin 


Lys 


Met 


Ala 


Ala 


Glu 


Lys 


Lys 


Met 


Gin 


He 


Asp 


Gly 


Pro 


Pro 


Gly 






355 










360 










365 








Ser 


Ala 


Glu 


Pro 


He 


Pro 


Pro 


Pro 


Arg 


Thr 


Ser 


Thr 


Leu 


Ser 


Arg 


Pro 




370 










375 










380 










Pro 


Pro 


Pro 


Ser 


Arg 


Leu 


Ser 


Glu 


Val 


Arg 


Phe 


Lys 


Val 


His 


Asp 


Pro 


385 










390 










395 










400 


Lys 


Ala 


Tyr 


Ser 


Lys 


Gly 


Gly 


Thr 


Leu 


Glu 


Asn 


Thr 


He 


Asn 


Gly 


Ala 
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405 410 415 



Arg 


Gly 


Pro 


TV 1 

Ala 


Pro 


Gly 


Pro 


Ala 


Pro 


Pro 


Ala 


Asp 


Glu 


Glu 


Ala 


Gly 








420 










425 










430 






Pro 


Pro 


Pro 


His 


Leu 


Val 


His 


Ala 


Ser 


Lys 


Gly 


He 


Asn 


Lys 


Leu 


Leu 






435 










440 










445 








Gly 


Thr 


Thr 


Pro 


Ser 


Asp 


He 


Asp 


Lys 


Tyr 


Ser Arg 


He 


Val 


Phe 


Pro 




450 










455 










4 60 










Val 


Cys 


Phe 


Val 


Cys 


Phe 


Asn 


Leu 


Met 


Tyr 


Trp 


He 


He 


Tyr 


Leu 


His 


465 










470 










475 










480 


Val 


Ser Asp 


Val 


Val 


Ala 


Asp Asp 


Leu 


Val 


Leu 


Leu 


Gly Glu Glu 


Asn 



485 490 495 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Gly 


Met 


Phe 


Gly 


Asp 


Val 


Asn 


He 


Ser 


Ala 


1 








5 










10 










15 




He 


Leu 


Asp 


Ser 


Leu 


Ser 


Val 


Ser 


Tyr 


Asp 


Lys 


Arg 


Val 


Arg 


Pro 


Asn 








20 










25 










30 






Tyr 


Gly 


Gly 


Pro 


Pro 


Val 


Asp 


Val 


Gly 


Val 


Thr 


Met 


Tyr 


Val 


Leu 


Ser 






35 










40 










45 








He 


Ser 


Ser 


Leu 


Ser 


Glu 


Val 


Lys 


Met 


Asp 


Phe 


Thr 


Leu 


Asp 


Phe 


Tyr 




50 










55 










60 








rne 


Arg 


bin 


rne 


Trp 


rp V. -w- 

1 nr 


TV _ _ 

Asp 


Pro Arg 


Leu 


Aia 


Tyr 


Lys 


Lys 


Arg 


Thr 


65 










70 










75 










80 


Gly 


Val 


Glu 


Thr 


Leu 


Ser 


Val 


Gly 


Ser 


Glu 


Phe 


He 


Arg 


Asn 


He 


Trp 










85 










90 










95 




Val 


Pro 


Asp 


Thr 


Phe 


Phe 


Val 


Asn 


Glu 


Lys 


Gin 


Ser 


Tyr 


Phe 


His 


He 








100 










105 










110 






Ala 


Thr 


Thr 


Ser 


Asn 


Glu 


Phe 


He Arg 


He 


His 


His 


Ser 


Gly Ser 


He 






115 










120 










125 








Thr 


Arg 


Ser 


He 


Arg 


Leu 


Thr 


He 


Thr 


Ala 


Ser 


Cys 


Pro 


Met 


Asp 


Leu 




130 










135 










140 










Gin 


Tyr 


Phe 


Pro 


Met 


Asp 


Arg 


Gin 


Leu 


Cys 


Asn 


He 


Glu 


He 


Glu 


Ser 


145 










150 










155 










160 


Phe 


Gly 


Tyr 


Thr 


Met 


Arg 


Asp 


He 


Arg 


Tyr 


Lys 


Trp 


Asn 


Glu 


Gly 


Pro 










165 










170 










175 




Asn 


Ser 


Val 


Gly 


Val 


Ser 


Ser 


Glu 


Val 


Ser 


Leu 


Pro 


Gin 


Phe 


Lys 


Val 








180 










185 










190 




Leu 


Gly 


His 


Arg 


Gin 


Arg 


Ala 


Met 


Glu 


He 


Ser 


Leu 


Thr 


Thr Gly 


Asn 






195 










200 










205 








Tyr 


Ser 


Arg 


Leu 


Ala 


Cys 


Glu 


He 


Gin 


Phe 


Val 


Arg 


Ser Met Gly 


Tyr 




210 










215 










220 










Tyr 


Leu 


He 


Gin 


He 


Tyr 


He 


Pro 


Ser 


Gly 


Leu 


He 


Val 


He 


He 


Ser 


225 










230 










235 










240 


Trp 


Val 


Ser 


Phe 


Trp 


Leu 


Asn 


Arg 


Asn 


Ala 


Thr 


Pro 


Ala 


Arg 


Val 


Ser 










245 










250 










255 




Leu 


Gly 


Val 


Thr 


Thr 


Val 


Leu 


Thr 


Met 


Thr 


Thr 


Leu 


Met 


Ser 


Ser 


Thr 








260 










265 










270 






Asn 


Ala 


Ala 


Leu 


Pro 


Lys 


He 


Ser 


Tyr 


Val 


Lys 


Ser 


He 


Asp Val 


Tyr 






275 










280 










285 








Leu 


Gly 


Thr 


Cys 


Phe 


Val 


Met 


Val 


Phe 


Ala 


Ser 


Leu 


Leu 


Glu 


Tyr 


Ala 




290 










295 










300 










Thr 


Val 


Gly 


Tyr 


Met 


Ala 


Lys 


Arg 


He 


Gin 


Met 


Arg 


Lys 


Gin Arg 


Phe 
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305 














Thr 


Ala 


Val 


Gin Lys 


Met 


Ala 


Ala 
















Pro 


Pro 


Gly 


Ser Ala 


olU 


Pro 


He 








340 








Ser 


Arg 


Pro 


Pro Pro 


Pro 


Ser Arg 




'3 t; 








OCA 


His 


Asp 


Pro 


Lys Ala 


Tyr 


Ser 


Lys 




o T 

3/0 








375 




Asn 


Gly 


Ala 


Arg Gly 


Pro 


Ala 


Pro 


385 








390 






Glu 


TV T „ 

Ala 


Gly 


Pro Pro 


Pro 


His 


Leu 
















Lys 


Leu 


Leu 


Gly Thr 


Thr 


Pro 


Ser 






420 








Val 


Phe 


Pro 


Val Cys 


Phe 


Val 


Cys 






435 








440 


Tyr 


Leu 


His 


Val Ser 


Asp Val 


Val 


450 








455 




Glu 


Glu 


Asn 










465 















315 320 



Glu 


Lys 


Lys 


Met 




T 1 o 

1 xe 


Asp 






330 














Pro 


Pro 


Pro Arg 


i nr 






Leu 


345 
















Leu 


Ser 


Glu 


vai 


A ■yry 


IT lie? 




Val 

V CIJ. 


















Gly Gly 


Thr 


Leu 


PI 11 
Ij-LU 


Asn 


Thr* 


T 1 o 

ij.e 








380 










Gly 


Pro 


Ala 


Pro 


Pro 


Ala 


Asp 


Glu 






395 










400 


Val 


His 


Ala 


Ser 


Lys 


Gly 


He 


Asn 




410 










415 




Asp 


He 


Asp 


Lys 


Tyr 


Ser 


Arg 


He 


425 










430 






Phe 


Asn 


Leu 


Met 


Tyr 


Trp 


He 


He 










445 








Ala 


Asp Asp 


Leu 


Val 


Leu 


Leu 


Gly 



4 60 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . . 1443 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



CGC CCC CGC TCC GCG CCG CTG CTG CTG GCG CTC GCG GCC GCC TTC CTA 48 
Arg Pro Arg Ser Ala Pro Leu Leu Leu Ala Leu Ala Ala Ala Phe Leu 
15 10 15 

CCG CAA GCC AAC CAT GTC GCG GGT GCC GGT GGG GGA GGG ATG TTC GGT 96 
Pro Gin Ala Asn His Val Ala Gly Ala Gly Gly Gly Gly Met Phe Gly 
20 25 30 

GAC GTC AAT ATA TCA GCC ATT TTG GAT TCA TTT AGT ATA AGT TAC GAC 144 
Asp Val Asn He Ser Ala He Leu Asp Ser Phe Ser He Ser Tyr Asp 
35 40 45 

AAA AGA GTA AGA CCA AAC TAT GGA GGT CCG CCA GTG GAG GTG GGC GTC 192 
Lys Arg Val Arg Pro Asn Tyr Gly Gly Pro Pro Val Glu Val Gly Val 
50 55 60 

ACC ATG TAT GTG CTC TCT ATC AGC TCC GTC TCC GAA GTG CTC ATG GAT 240 
Thr Met Tyr Val Leu Ser He Ser Ser Val Ser Glu Val Leu Met Asp 
65 70 75 80 

TTC ACA TTG GAC TTT TAC TTC AGA CAA TTT TGG ACT GAT CCT CGA TTA 288 
Phe Thr Leu Asp Phe Tyr Phe Arg Gin Phe Trp Thr Asp Pro Arg Leu 
85 90 95 
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GCA TAG AAA AAA AGA ACC GGA GTT GAA ACT TTA TCT GTG GGC TCA GAA 
Ala Tyr Lys Lys Arg Thr Gly Val Glu Thr Leu Ser Val Gly Ser Glu 

100 105 110 



336 



TTC ATA AAG AAC ATA TGG GTA CCC GAG AGG TTC TTT GTA AAT GAA AAG 384 
Phe He Lys Asn He Trp Val Pro Asp Thr Phe Phe Val Asn Glu Lys 
115 120 125 

CAA TCT TAT TTC CAT ATA GCA ACA ACC AGG AAT GAA TTC ATC GGT ATA 432 
Gin Ser Tyr Phe His He Ala Thr Thr Ser Asn Glu Phe He Arg He 
130 135 140 

GAG TAT TCT GGC TCT ATC ACT AGA AGT ATC AGA TTG AGG ATC ACA GCC 480 
His Tyr Ser Gly Ser He Thr Arg Ser He Arg Leu Thr He Thr Ala 
145 150 155 160 

TCT TGG CCG ATG AAT TTG CAA TAG TTC GCG ATG GAT GGA CAG TTG TGG 528 
Ser Cys Pro Met Asn Leu Gin Tyr Phe Pro Met Asp Arg Gin Leu Cys 
165 170 175 

GAG ATA GAA ATT GAA AGT TTG GGC TAG ACC ATG GGG GAG ATG AGA TAG 57 6 

His He Glu He Glu Ser Phe Gly Tyr Thr Met Arg Asp He Arg Tyr 
180 185 190 

AAA TGG AAC GAA GGG CCG AAC TCT GTG GGT GTT TGG AGG GAA GTG TGG 624 
Lys Trp Asn Glu Gly Pro Asn Ser Val Gly Val Ser Ser Glu Val Ser 
195 200 205 

CTG CCG CAG TTC AAG GTG GTG GGT GAT GGC GAA GGA GGT ATG GAG ATG 672 
Leu Pro Gin Phe Lys Val Leu Gly His Arg Gin Arg Ala Met Glu He 
210 215 220 

TGG GTT AGT AGA GGA AAT TAT TCA GGG TTG GGA TGT GAA ATA GAA TTC 720 
Ser Leu Thr Thr Gly Asn Tyr Ser Arg Leu Ala Cys Glu He Gin Phe 
225 230 235 240 

GTT GGG TGT ATG GGA TAT TAG TTA ATC CAA ATT TAT ATT CCC TGT GGT 768 
Val Arg Ser Met Gly Tyr Tyr Leu He Gin He Tyr He Pro Ser Gly 

245 250 255 



TTG ATT GTG ATC ATA TCA TGG GTA TCA TTT TGG TTG AAT GGA AAT GGG 816 
Leu He Val He He Ser Trp Val Ser Phe Trp Leu Asn Arg Asn Ala 
260 265 270 

ACA CCA GGT CGA GTG GCC GTA GGT GTT AGG ACT GTA TTG AGA ATG ACA 864 
Thr Pro Ala Arg Val Ala Leu Gly Val Thr Thr Val Leu Thr Met Thr 
275 280 285 

AGG GTT ATG TGG TCT AGT AAG GCG GGG CTG GCC AAG ATG TCA TAG GTG 912 
Thr Leu Met Ser Ser Thr Asn Ala Ala Leu Pro Lys He Ser Tyr Val 
290 295 300 

AAA TGG ATA GAT GTA TAT GTG GGG ACA TGT TTC GTC ATG GTA TTC GGT 960 
Lys Ser He Asp Val Tyr Leu Gly Thr Cys Phe Val Met Val Phe Ala 
305 310 315 320 

AGT GTA CTA GAA TAG GCG ACT GTG GGA TAT ATG GCA AAG AGA ATA GAG 1008 
Ser Leu Leu Glu Tyr Ala Thr Val Gly Tyr Met Ala Lys Arg He Gin 
325 330 335 

ATG AGA AAA CAA AGA TTT GTG GGG ATA CAG AAA ATA GGT TCT GAA AAG 1056 
Met Arg Lys Gin Arg Phe Val Ala He Gin Lys He Ala Ser Glu Lys 
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340 345 350 

AAA ATC CCC GTT GAC TGC CCA CCC GTA GGC GAT CCA CAT ACT TTA TCG 1104 
Lys He Pro Val Asp Cys Pro Pro Val Gly Asp Pro His Thr Leu Ser 
355 360 365 

AAG ATG GGA ACA CTT GGC AGA TGC CCA CCC GGT AGA CCA TCG GAG GTG 1152 
Lys Met Gly Thr Leu Gly Arg Cys Pro Pro Gly Arg Pro Ser Glu Val 
370 375 380 

CGG TTC AAA GTG CAT GAC CCA AAA GCG CAT TCC AAA GGC GGG ACG TTA 1200 
Arg Phe Lys Val His Asp Pro Lys Ala His Ser Lys Gly Gly Thr Leu 
385 390 395 400 

GAG AAC ACT ATT AAT GGA GGT CGC AGT GGA GCA GAA GAA GAA AAC CCA 124 8 
Glu Asn Thr He Asn Gly Gly Arg Ser Gly Ala Glu Glu Glu Asn Pro 
405 410 415 

GGC CCG CCC CCA CAC ATT TTA CAT CCC GGC AAG GAC ATA AGC AAA CTG 1296 
Gly Pro Pro Pro His He Leu His Pro Gly Lys Asp He Ser Lys Leu 
420 425 430 

CTC GGC ATG ACT CCC TCG GAC ATC GAC AAG TAG TCG CGC ATC GTG TTC 134 4 
Leu Gly Met Thr Pro Ser Asp He Asp Lys Tyr Ser Arg He Val Phe 
435 440 445 

CCC GTC TGC TTC GTT TGC TTT AAC CTT ATG TAG TGG ATC ATT TAG CTT 1392 
Pro Val Cys Phe Val' Cys Phe Asn Leu Met Tyr Trp He He Tyr Leu 
450 455 460 

CAC GTG TCT GAC GTC GTG GCT GAC GAT CTG GTT CTA CTG GAA GAG GAT 144 0 
His Val Ser Asp Val Val Ala Asp Asp Leu Val Leu Leu Glu Glu Asp 
465 470 475 480 

PiPJi TAGAGGGCGC AGTACATAAT CCACTTATTT TCCACAWCTG CAAGCTAAAT AAT AAT 1499 
Lys 



TTGAAACGGA TAAAACTTTA 1519 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Arg Pro Arg Ser Ala Pro Leu Leu Leu Ala Leu Ala Ala Ala Phe Leu 

15 10 15 

Pro Gin Ala Asn His Val Ala Gly Ala Gly Gly Gly Gly Met Phe Gly 

20 25 30 

Asp Val Asn He Ser Ala He Leu Asp Ser Phe Ser He Ser Tyr Asp 

35 40 45 

Lys Arg Val Arg Pro Asn Tyr Gly Gly Pro Pro Val Glu Val Gly Val 

50 55 60 

Thr Met Tyr Val Leu Ser He Ser Ser Val Ser Glu Val Leu Met Asp 
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65 






70 










75 










oU 


Phe Thr 


Leu 


Asp Phe 


Tyr 


Phe 


Arg 


Gin 


rne 


Trp Thr 


Asp 


Pro Arg 


Leu 






Q t; 
O J 








on 










35 




Ala Tyr 


Lys 


Lys Arg 

iUU 


Thr 


Gly 


Val 


Glu 
105 


i nr 


T All 

Leu 


Ser 


vai 


biy 

IIV 


Ser 


blU 


Phe lie 


Lys 


Asn lie 


Trp Val 


Pro Asp 


1 nr 


Phe 


Phe 


vai 


Asn 


Glu 


Lys 




lit; 








120 










± z o 








Gin Ser 


Tyr 


Irne His 


lie 


Ala 


Thr 


Thr 


C r-i V- 

oer 


Asn 


Glu 


rne 


ixe 


Arg 


lie 


130 








135 










140 










His Tyr 


Ser 


Gly Ser 


lie 


Thr 


Arg 


Ser 


X J.e 


Arg 


Leu 


1 nr 


lie 


Thr 


i\ia 


1 >l c 

145 






150 










155 










1 OU 


Ser Cys 


Pro 


Met Asn 


Leu 


Gin 


Tyr 


Phe 


Pro 


Met 


Asp 


Arg 


Gin 


Leu 


Cys 




1 Cti 
lOD 










1 Tfl 

JL / u 










175 




His lie 


Glu 


lie Glu 


Ser 


Phe 


Gly Tyr 


rp Vs 

i nr 


Met Arg 


Asp 


He 


Arg 


Tyr 






180 








185 










190 






Lys Trp 


Asn 


Glu Gly 


Pro 


Asn 


Ser 


Val 


Gly 


Val 


Ser 


ber 


Glu 


Val 


Ser 


195 








200 










205 








Leu Pro 


Gin 


Phe Lys 


Val 


Leu 


Gly His 


Arg 


Gin Arg 


Ala 


Met 


Glu 


He 


210 








215 










220 










Ser Leu 


Thr 


Thr Gly 


Asn 


Tyr 


Ser 


Arg 


Leu 


Ala 


Cys 


blU 


He 


Gin 


Pne 


225 






230 










235 












Val Arg 


Ser 


Met Gly 

O >l R 


Tyr 


Tyr 


Leu 


He 


Gin 
250 


He 


Tyr 


lie 


Pro 


Ser 
255 


Gly 


Leu lie 


Val 


lie lie 
260 


Ser 


Trp 


Val 


Ser 
265 


Phe 


Trp 


Leu 


Asn 


Arg 

OTA 


Asn 


Ala 


Thr Pro 


Ala 


Arg Val 


Ala 


Leu 


Gly Val 


Thr 


Thr 


Val 


Leu 


Thr 


Met 


Thr 




275 






280 










o Q 








Thr Leu 


Met 


Ser Ser 


Thr 


Asn 


Ala 


Ala 


Leu 


Pro 


Lys 


lie 


Ser 


Tyr 


Val 


290 








295 










300 










Lys Ser 


lie 


Asp Val 


Tyr 


Leu 


Gly Thr Cys 


Phe 


Val 


Me L 


Val 


Phe 


Ala 


one 






310 










315 












Ser Leu 


Leu 


Glu Tyr 


Ala 


Thr 


Val 


Gly Tyr 


Met 


Ala 


Lys 


Arg 


He 


bin 






"3 o c; 










330 










335 




Met Arg 


Lys 


Gin Arg 


Phe 


Val 


Ala 


He 


Gin 


Lys 


He 


Ala 


Ser 


Glu 


Lys 




j4U 








345 










350 






Lys lie 


Pro 


Val Asp 


Cys 


Pro 


Pro 


Val 


Gly 


Asp 


Pro 


His 


Thr 


Leu 


C A V 

oer 


^ R 








360 










365 








Lys Met 


Gly 


Thr Leu 


Gly Arg 


Cys 


Pro 


Pro 


Gly Arg 


Pro 


Ser 


Glu 


vai 


370 








375 










380 










Arg Phe 


Lys 


Val His 


Asp 


Pro 


Lys 


Ala 


His 


Ser 


Lys 


Gly Gly Thr 


Leu 


385 






390 










395 










A nn 

4 \J\J 


Glu Asn 


Thr 


lie Asn 


Gly Gly 


Arg 


Ser Gly 


Ala 


Glu 


Glu 


Glu 


Asn 


Pro 






405 










410 










415 




Gly Pro 


Pro 


Pro His 


He 


Leu 


His 


Pro 


Gly 


Lys 


Asp 


He 


Ser 


Lys 


Leu 




420 








425 










430 






Leu Gly Met 


Thr Pro 


Ser Asp 


He 


Asp 


Lys 


Tyr 


Ser 


Arg 


He 


Val 


Phe 




435 








440 










445 








Pro Val Cys 


Phe Val 


Cys 


Phe 


Asn 


Leu 


Met 


Tyr Trp 


He 


He 


Tyr 


Leu 


450 






455 










4 60 










His Val 


Ser 


Asp Val 


Val 


Ala 


Asp Asp 


Leu 


Val 


Leu 


Leu Glu Glu Asp 


465 




470 










475 










480 


Lys 





























(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Ala 


Gly 


Ala 


Gly 


Gly 


Gly 


Gly 


Met 


Pne 


Gly Asp 


vai 


Asn 


T 1 Q 

lie 


oer 


Ala 

/iia 


1 ■ 






5 










1 A 

10 










10 




lie 


Leu 


Asp 


Ser 


Pne 


Ser 


He 


Ser 


Tyr 


Asp 


Lys 


Arg 


vai 


Arg 


jrro 


Asn 






20 










ZD 










JU 






Tyr 


Gly 


Gly 


Pro 


Pro 


Val 


Glu 


Val 


Gly 


Val 


Thr 


Met 


Tyr 


Val 


Leu 


Ser 


35 










/I A 

40 


















He 


Ser 
50 


Ser 


Val 


Ser 


Glu 


Val 
55 


Leu 


Met 


Asp 


Phe 


Thr 

DU 


Leu 


Asp 


Phe 


Tyr 


Phe 


Arg 


Gin 


Phe 


Trp 


Thr 


Asp 


Pro 


Arg 


Leu 


Ala 


Tyr 


Lys 


Lys 


Arg 


1 nr 


65 








70 










75 










oU 


Gly 


Val 


Glu 


Thr 


Leu 


Ser 


Val 


Gly 


Ser 


Glu 


Phe 


He 


Lys 


Asn 


He 


Trp 








85 










90 










yo 




Val 


Pro 


Asp 


Thr 


Phe 


Phe 


Val 


Asn 


Glu 


Lys 


Gin 


Ser 


Tyr 


Phe 


His 


He 






100 










105 










1 1 A 

HO 






Ala 


Thr 


Thr 
115 


Ser 


Asn 


Glu 


Phe 


He 

120 


Arg 


He 


His 


Tyr 


Ser 

125 


(jiy 


Ser 


lie 


Thr 


Arg 
130 


Ser 


He 


Arg 


Leu 


Thr 
135 


He 


Thr 


Ala 


Ser 


Cys 
14 0 


Pro 


Met 


TV _ _ 

Asn 


Leu 


Gin 


Tyr 


Phe 


Pro 


Met 


Asp 


Arg 


Gin 


Leu 


Cys 


His 


T 1 ^ 

He 


blU 


1 le 


blU 


Ser 


145 








150 










155 










1 CA 

loU 


Phe 


Gly 


Tyr 


Thr 


Met 
165 


Arg 


Asp 


He 


Arg 


Tyr 
170 


Lys 


Trp 


TV n n 

Asn 




biy 

175 


Fro 


Asn 


Ser 


Val 


Gly 
180 


Val 


Ser 


Ser 


Glu 


Val 
185 


Ser 


Leu 


Pro 


Gin 


Pne 

1 QA 

lyu 


Lys 


Val 


Leu 


Gly 


His 


Arg 


Gin 


Arg 


Ala 


Met 


Glu 


He 


Ser 


Leu 


Thr 


Thr 


Gly 


Asn 




195 










200 










on 
ZVD 








Tyr 


Ser 


Arg 


Leu 


Ala 


Cys 


Glu 


He 


Gin 


Phe 


Val 


Arg 


Ser 


Meu 


vjiy 


Tyr 


210 








215 










ZZu 










Tyr 


Leu 


He 


Gin 


He 


Tyr 


He 


Pro 


Ser 


Gly 


Leu 


lie 


Val 


T 1 o 

X xe 


X xe 


oer 


225 










O O A 

230 










235 










0 A r\ 


Trp 


Val 


Ser 


Phe 


Trp 


Leu 


Asn 


Arg 


Asn 


Ala 


Thr 


Pro 


Ala 


Arg 


Val 


Ala 








245 










250 










o i; e; 
ZDO 




Leu 


Gly 


Val 


Thr 


Thr 


Val 


Leu 


Thr 


Met 


Thr 


Thr 


Leu 


Met 


Ser 


Ser 


Thr 






260 










o c c 










Z / u 






Asn 


Ala 


Ala 
275 


Leu 


Pro 


Lys 


T 1 « 

He 


i>er 

O O A 


Tyr 


Val 


Lys 


o6r 


0 Q c; 
ZoO 


i-iop 


vox 


lyr 


Leu 


Gly 


Thr 


Cys 


Phe 


Val 


Met 


Val 


Phe 


Ala 


Ser 


Leu 


Leu 


Glu 


Tyr 


Ala 




290 








O Q 




















Thr 


Val 


Gly 


Tyr 


Kyi _ 4. 

Met 


Ala 


Lys 


Arg 


lie 


Gin 


Met 


Arg 


Lys 


m n 

oXIl 


rlx y 


trxic 


"5 c 

305 








olU 










315 












Val 


TV 1 

Ala 


He 


Gin 


Lys 

o o c 

325 


T 1 ^ 

lie 


/\ia 


Ser 


VjlU 


Lys 
330 


Lys 




IT S.KJ 


Vox 






Pro 


Pro 


vai 


b±y 
340 


Asp 


Pro 


n IS 


1 nr 


Leu 

o4 D 


Ser 


Lys 


Met- 
ric I. 


O JL 


Thr 


Leu 


Gly 


Arg 


Cys 


Pro 


Pro 


Gly 


Arg 


Pro 


Ser 


Glu 


Val Arg 


Phe 


Lys 


Val 


His 


Asp 


Job 








JOU 


















Pro 


Lys 

370 


Ala 


His 


Ser 


Lys 


Gly 

J /D 


Gly 


Thr 


Leu 


Glu 


Asn 
Jo u 


Thr 


He 


Asn 


Gly 


Gly 


Arg 


Ser 


Gly 


Ala 


Glu 


Glu 


Glu 


Asn 


Pro Gly 


Pro 


Pro 


Pro 


His 


He 


385 






390 










395 










400 


Leu 


His 


Pro 


Gly 


Lys 


Asp 


He 


Ser 


Lys 


Leu 


Leu 


Gly 


Met 


Thr 


Pro 


Ser 








405 










410 










415 




Asp 


He 


Asp 


Lys 


Tyr 


Ser 


Arg 


He 


Val 


Phe 


Pro 


Val 


Cys 


Phe 


Val 


Cys 






420 










425 










430 






Phe 


Asn 


Leu 
435 


Met 


Tyr 


Trp 


He 


He 
440 


Tyr 


Leu 


His 


Val 


Ser 
445 


Asp 


Val 


Val 


Ala 


Asp 
450 


Asp 


Leu 


Val 


Leu 


Leu 
455 


Glu 


Glu 


Asp 


Lys 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 2. , .667 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



C TTC GTG AAC GAA AAG CAA TCG TAC TTC CAC ACG GCC ACC ACC AGT AAT 4 9 
Phe Val Asn Glu Lys Gin Ser Tyr Phe His Thr Ala Thr Thr Ser Asn 
15 10 15 

GAG TTC ATC CGC ATC CAC CAC TCG GGC TCC ATC ACG CGT AGC ATA AGG 97 
Glu Phe lie Arg He His His Ser Gly Ser He Thr Arg Ser He Arg 
20 25 30 



CTC ACC ATC ACG GCC TCC TGC CCC ATG AAC CTG CAG TAC TTC CCC ATG 145 
Leu Thr He Thr Ala Ser Cys Pro Met Asn Leu Gin Tyr Phe Pro Met 
35 40 45 

GAT CGG CAG CTG TGC CAC ATC GAG ATC GAG AGT TTC GGC TAC ACC ATG 193 
Asp Arg Gin Leu Cys His He Glu He Glu Ser Phe Gly Tyr Thr Met 
50 55 60 

CGG GAC ATC CGG TAC AAA TGG AAC GAG GGG NCC AAC TCG GTG GGC GTT 241 
Arg Asp He Arg Tyr Lys Trp Asn Glu Gly Xaa Asn Ser Val Gly Val 
65 70 75 80 

TCA AAC GAA GTG TCG CTA CCG CAG TTC AAG GTG TTG GGC CAT CGT CAA 289 
Ser Asn Glu Val Ser Leu Pro Gin Phe Lys Val Leu Gly His Arg Gin 
85 90 95 

CGT GCC ATG GAA ATA TCG CTC ACA ACA GGA AAC TAC TCC CGG CTG GCG 337 
Arg Ala Met Glu He Ser Leu Thr Thr Gly Asn Tyr Ser Arg Leu Ala 
100 105 110 

TGC GAG ATC CAG TTC GTG CGC TCG ATG GGC TAC TAC CTG ATC CAG ATC 385 
Cys Glu He Gin Phe Val Arg Ser Met Gly Tyr Tyr Leu He Gin He 
115 120 125 

TAC ATA CCA TCC GGC CTC ATC GTC ATA ATA TCG TGG GTG TCT TTC TGG 433 
Tyr He Pro Ser Gly Leu He Val He He Ser Trp Val Ser Phe Trp 
130 135 140 

TTG AAC CGC AAC GCG ACG CCG GCG CGC GTG CAG CTG GGC GTC ACC ACC 481 
Leu Asn Arg Asn Ala Thr Pro Ala Arg Val Gin Leu Gly Val Thr Thr 
145 150 155 160 

GTG CTC ACC ATG ACC ACG CTC ATG TCT TCC ACT AAT GCG GCG CTG CCG 529 
Val Leu Thr Met Thr Thr Leu Met Ser Ser Thr Asn Ala Ala Leu Pro 
165 170 175 



AAG ATC TCG TAC GTT AAG TCC ATC GAT GTG TAC CTC GGC ACC TGC TTC 
Lys He Ser Tyr Val Lys Ser He Asp Val Tyr Leu Gly Thr Cys Phe 



577 
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180 185 190 

GTT ATG GTG TTC ACC AGT CTG CTA GAG TAG GCG ACG GTG GGG TAT ATG 625 

Val Met Val Phe Thr Ser Leu Leu Glu Tyr Ala thr Val Gly Tyr Met 

195 200 205 

TCG AAG AGA ATA GAG ATG AGA AAG CAG CGC TTT GTC GCG ATC CC 669 

Ser Lys Arg lie Gin Met Arg Lys Gin Arg Phe Val Ala He 

210 215 220 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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210 215 220 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



GCRAANACCA TNACRAARCA 



20 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTNGTCATNG TSAGNACNGT 20 
(2) INF0Rb4ATI0N FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGGGTNCCNG AYACNNT 1 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCGAGCTCSW RTAYTTRTCD ATRTC 25 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGAGCTCAR RTADATDATC CARTACAT 28 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



AGGCGGCCGC GGNGTNACNA TGTAYGT 



27 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTGCGGCCGC CARTTYTGGA CNGAYCC 27 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AATCTAGAGG GTGTCTTTCT GGTTG 25 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AGCTCGAGAG TTTCGGCTAC ACCAT 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTCTCGAGCG ATGGATTTGC ACTATTTTC 

(2) INFORMATION FOR SEQ ID NO: 19: 



29 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAGAGCTCAT TTCACATGCC AGACGAGAG 2 9 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TAGAGCTCGA ATGATGAATG CGTATGAAT 29 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TCTCTAGATA CGCTCGATGG GATAC 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TTGCGGCCGC CATATATCCC ACAG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CTGCGGCCGC TCGAGCTGGT G 



21 
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(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CGGATGAATT CATTGCTGGT TGTT 24 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CTGTCGATCC ATCGGGAAGT ATTG 24 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GCGGACCTCC ATAGTTTGGT C 21 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CAGACGAAGA AGCTGGACCA CCTC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



wo 98/49185 





PCT/US98/08563 



-41 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



ACGCGGCCGC AAGGACATAA GCAA 



24 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TTCTCGAGCG ATGGATTTGC ACTATTTTC 29 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AGTCCCGGCG GCAGGCTGAT A 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
ATGACAATTA GGCCAGACGG AATA 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CATCCGATAC AAGTGGAATG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



GTCGACCCAG TGCCAATATA CACGAC 



26 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GTCGACTTAC CGAAACTTGA TGGATG 26 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TTGCGGCCGC CATATATCCC ACAG 24 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GTGAAATACA ATTCGTTCGG TCTA 24 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GCAGATGTGG AAAAT7VAGTG GATT 24 
(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AGCGAATACC ATGACGAAAC A 21 
(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GCGGCCGCCT TCCTAC 16 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
ACCATCCATT TAGACGACGC 20 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 
GTAACACCAA CTTCCACCG 19 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GAATGGCCAA CATGTCGCTG GAAATC 



26 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
AATAATGACG TCACCGAACA TCCCTCCCCC ACCG 34 

The nucleic acid sequences described herein, and consequently the protein 
sequences derived therefrom, have been carefully sequenced. However, those of 
ordinary skill will recognize that nucleic acid sequencing technology can be susceptible 
to some error. Those of ordinary skill in the relevant arts are capable of validating or 

5 correcting these sequences based on the ample description herein of the nucleic acid 

sequences and of methods of isolating these nucleic acid sequences. Thus such corrected 
sequences, and such modifications that are made readily available by the present 
disclosure, are encompassed by the present invention. Furthermore, those sequences 
reported herein are within the invention whether or not later clarifying studies identify 

1 0 sequencing errors. 

The Aedes signal peptide/Heliothis GABA a3 (Ala) chimera described above was 
assembled and cloned into the Mscl/Sall sites of a Novagen pT7Blue-2 Xenopus 
transcription vector placing it under the control of a T7 promoter. This plasmid was 
stored and designated as pT7HGABA-a3. The Heliothis GABA a2 (Ser) isoform was 

15 assembled and cloned into the Pmel/BamHI sites of a Novagen pIEl-3 baculovirus 
expression vector placing it under the control of the iel baculovirus promoter. This 
plasmid was stored and designated as pIE3HGABA-a2. 

While this invention has been described with an emphasis upon preferred 
embodiments, it will be obvious to those of ordinary skill in the art that variations in the 

20 preferred devices and methods may be used and that it is intended that the invention may 
be practiced otherwise than as specifically described herein. Accordingly, this invention 
includes all modifications encompassed within the spirit and scope of the invention as 
defined by the claims that follow. 
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What is claimed: 

1. An isolated nucleic acid encoding a GABA-gated chloride channel 
comprising: 

(a) a nucleic acid including a sequence encoding a protein sequence of 
5 SEQ ID 3, SEQ ID 6 or SEQ ID 8, or a sequence having at least about 85% sequence 

identity with SEQ ID 3, SEQ ID 6 or SEQ ID 8; or 

(b) a nucleic acid that hybridizes with a nucleic acid encoding a protein 
sequence of SEQ ID 3, SEQ ID 6 or SEQ ID 8 or the complementary sequence to SEQ 
ID 3, SEQ ID 6 or SEQ ID 8, under stringent conditions; or 

10 (c) a nucleic acid that hybridizes with a nucleic acid having a sequence of 

SEQ ID 1, SEQ ID 4 or SEQ ID 7 or the complementary sequence to SEQ ID 1, SEQ ID 
4 or SEQ ID 7, under stringent conditions; or 

(d) a nucleic acid has at least about 85 % sequence identity with the 
coding region of SEQ ID 1, SEQ ID 4 or SEQ ID 7, 

15 

2. An expression vector for expressing a GABA-gated chloride channel 
comprising the nucleic acid of claim 1 . 

3. The vector of claim 2, wherein expression of the nucleic acid is driven by an 
20 inducible promoter. 

4. A process of producing a recombinant cell that expresses a recombinant 
GABA-gated chloride channel comprising transforming a cell with the vector of claim 2. 

25 5. A cell comprising a nucleic acid encoding a GABA-gated chloride channel, 

wherein the nucleic acid is functionally associated with a promoter, and wherein the 
nucleic acid: 

(a) includes a protein sequence of SEQ ID 3, SEQ ID 6 or SEQ ID 8, or a 
sequence having at least about 85% sequence identity with SEQ ID 3, SEQ ID 6 or SEQ 
30 IDS; or 
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(b) that hybridizes with a nucleic acid encoding a protein sequence of 
SEQ ID 3, SEQ ID 6 or SEQ ID 8, or a complementary sequence to SEQ ID 3, SEQ ID 6 
or SEQ ID 8, imder stringent conditions; or 

(c) that hybridizes with a nucleic acid having a sequence of SEQ ID 1, 

5 SEQ ID 4 or SEQ ID 7 or the complementary sequence to SEQ ID 1, SEQ ID 4 or SEQ 
ID 7, under stringent conditions; or 

(d) a nucleic acid has at least about 85 % sequence identity with the 
coding region of SEQ ID 1 , SEQ ID 4 or SEQ ID 7. 

10 6. The cell of claim 5, wherein the cell expresses a recombinant GABA-gated 

chloride channel at its cell surface. 



7. A process of producing a GABA-gated chloride channel comprising 
expressing the protein in the cell of claim 5. 

15 

8. The process of claim 7, wherein the promoter is an inducible promoter and the 
process further comprises: 

growing the cell in a medium; and 

inducing the expression of the GABA-gated chloride channel by adding an 
20 inducing agent into the medium. 

9. The method of claim 7 further comprising at least one of (a) isolating 
membranes from said cells, which membranes comprise the GABA-gated chloride 
channel or (b) extracting a protein fraction from the cells which fraction comprises the 

25 GABA-gated chloride channel. 



10. An GABA-gated chloride channel isolated from a cell according to claim 5 
and expressed by the extrinsically-derived nucleic acid 
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1 1 . A method for characterizing a bioactive agent, the method comprising (a) 
providing a first assay composition comprising (i) a cell according to claim 5 or (ii) an 
isolated GABA-gated chloride channel comprising the amino acid sequence encoded by 
the nucleic acid of the vector, or the amino acid sequence resulting from cellular 
5 processing of the amino acid sequence encoded by the nucleic acid of the vector, (b) 
contacting the first assay composition with the bioactive agent or a prospective bioactive 
agent, and (c) measuring the binding of the bioactive agent or prospective bioactive agent 
or a cellular response mediated by a isolated GABA-gated chloride channel. 



10 12. A nucleic acid probe that identifies with the nucleic acid of claim 1, or the 

complementary sequence thereof, under selective conditions. 

13. The nucleic acid probe of claim 12, wherein the nucleic acid is an 
amplification primer and the selective conditions are amplification conditions effective to 

1 5 amplify a GABA-gated chloride channel sequence but not to amplify a GABA-gated 
chloride channel sequence fi-om Drosophila, Aedes, locust or beetle. 

14. An isolated GABA-gated chloride channel including a sequence of SEQ ID 
3, SEQ ID 6 or SEQ ID 8, or a sequence having at least about 85% sequence identity 

20 with SEQ ID 3, SEQ ID 6 or SEQ ID 8. 



15. An isolated GABA-gated chloride channel composition comprising isolated 
membranes in which the GABA-gated chloride channel protein of claim 14 integral 
proteins. 
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C TTC GTG AAC GAA AAG CAA TCG TAG TTC CAC ACG GCC ACC AGO AGT AAT 4 9 
Phe Val Asn Glu Lys Gin Ser Tyr Phe His Thr Ala Thr Thr Ser Asn 
1 5 10 . 15 

GAG TTC ATC CGC ATC CAC CAC TCG GGC TCC ATC ACG CGT AGC ATA AGG 97 
Glu Phe He Arg He His His Ser Gly Ser He Thr Arg Ser He Arg 
20 25 30 

CTC ACC ATC ACG GCC TCC TGC CCC ATG AAC CTG CAG TAG TTC CCC ATG 145 
Leu Thr He Thr Ala Ser Cys Pro Met Asn Leu Gin Tyr Phe Pro Met . 
35 40 45 

GAT CGG CAG CTG TGC CAC ATC GAG ATC GAG AGT TTC GGC TAC ACC ATG 193 
Asp Arg Gin Leu Cys His He Glu He Glu Ser Phe Gly Tyr Thr Met 
50 55 60 

CGG GAC ATC CGG TAC AAA TGG AAC GAG GGG NCC AAC TCG GTG GGC GTT 241 
Arg Asp He Arg Tyr Lys Trp Asn Glu Gly Xaa Asn Ser Val Gly Val 
65 70 75 80 

TCA AAC GAA GTG TCG CTA CCG CAG TTC AAG GTG TTG GGC CAT CGT CAA 289 
Ser Asn Glu Val Ser Leu Pro Gin Phe Lys Val Leu Gly His Arg Gin 
85 90 95 

CGT GCC ATG GAA ATA TCG CTC ACA ACA GGA AAC TAC TCC CGG CTG GCG 337 
Arg Ala Met Glu He Ser Leu Thr Thr Gly Asn Tyr Ser Arg Leu Ala 
100 105 110 

TGC GAG ATC CAG TTC GTG CGC TCG ATG GGC TAC TAC CTG ATC CAG ATC 385 
Cys Glu He Gin Phe Val Arg Ser Met Gly Tyr Tyr Leu He Gin He 
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TAC ATA CCA TCC GGC CTC ATC GTC ATA ATA TCG TGG GTG TCT TTC TGG 433 
Tyr He Pro Ser Gly Leu He Val He He Ser Trp Val Ser Phe Trp 
130 135 140 

TTG AAC CGC AAC GCG ACG CCG GCG CGC GTG CAG CTG GGC GTC ACC ACC 481 
Leu Asn Arg Asn Ala Thr Pro Ala Arg Val Gin Leu Gly Val Thr Thr 
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GTG CTC ACC ATG ACC ACG CTC ATG TCT TCC ACT AAT GCG GCG CTG CCG 529 
Val Leu Thr Met Thr Thr Leu Met Ser Ser Thr Asn Ala Ala Leu Pro 
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AAG ATC TCG TAC GTT AAG TCC ATC GAT GTG TAC CTC GGC ACC TGC TTC 577 
Lys He Ser Tyr Val Lys Ser He Asp Val Tyr Leu Gly Thr Cys Phe 
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GTT ATG GTG TTC ACC AGT CTG CTA GAG TAC GCG ACG GTG GGG TAT ATG 625 
Val Met Val Phe Thr Ser Leu Leu Glu Tyr Ala Thr Val Gly Tyr Met 
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TCG AAG AGA ATA CAG ATG AGA AAG CAG CGC TTT GTC GCG ATC CC 669 
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